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A DISPROOF OF THE LAW OF EFFECT AND A 
SUBSTITUTION OF THE LAWS OF 
EMPHASIS, MOTIVATION AND 
DISRUPTION 


BY E. C. TOLMAN, C. S. HALL, AND E. P. BRETNALL 
University of California 


The present investigation was formulated with the idea 
of discovering whether or not the Law of Effect in the form in 
which it is usually stated by Thorndike (for example (2), p. 
36) necessarily and inevitably holds.!. The Law states that, 
in a trial and error situation, the right responses tend to get 
selected as against the wrong responses primarily because the 
right responses are followed by a good effect (1.¢., satisfiers) 
whereas the wrong responses tend to drop out because they are 
followed by bad effects (1.¢., annoyers). It was the aim of the 
present experiment to discover what would happen if an 
annoyer (electric shock) were applied not, as is usually done, 
after the wrong responses but after the right responses. We 
sought, that is, to compare the results for such an arrangement 
with those for the more usual set-up in which the annoyer 
(electric shock) was applied after the wrong responses. 

A pparatus.—The apparatus used was a device intended to 
be for human beings relatively comparable to a maze for rats. 


We have called it the ‘punchboard maze.’? It consisted of 
1A brief preliminary statement of the experiment has already appeared. See 
Tolman (3), pp. 344-345- 
2 Designed by Robert H. Bruce and hitherto unreported. 
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thirty pairs of holes arranged in an irregular spatial order on a 
board three feet square (see Fig. 1). The subject’s task was 
to go through the thirty pairs, punching one hole in each pair 
with a stylus (analogous to entering the blind or the true path 
section at each of thirty choice points in a maze) and thus to 
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Fic. 1. Punch Board Maze. The starred holes are those which for Groups I, II, III 
and IV sounded. The stars did not appear on the actual maze itself. 


learn for each pair which of the two holes was correct. The 
board was placed on an easel so that it sloped at a slight angle 
away from the subject and so that the bottom edge was 
twenty-one inches from the floor. Some of the subjects stood 
and some sat during the test performance. The board had 
60 holes and was covered with a facing of heavy wrapping 
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paper. The holes were three-quarters of an inch deep and 
were backed by uniform brass contact plates. The contact 
plates of one hole in each of the thirty pairs was connected 
with an electric circuit so that when the stylus was introduced 
it rang a buzzer or, in the case of some groups, gave an electric 
shock plus the ringing of the buzzer. 

Subjects and Procedure.—There were five groups of sub- 
jects. The conditions under which these different groups 
were tested were as follows: 

Groups I and II and Ia were the control groups. For 
them no shock but merely the electric buzzer was used. This 
buzzer was connected with one hole of each of the pairs of 
holes so that when the metal stylus was pushed into the hole 
against the contact plate the buzzer rang. 

Group I learned to avoid these holes which thus sounded 
the buzzer when the stylus was pushed into them. This 
group will be referred to as the Bell-Wrong group (BW). 

Group II learned to select the buzzing holes. This group 
will be referred to as the Bell-Right group (BR). 

Group Ia was a special control group. For it the wiring 
was reversed so that the buzzer sounded on the opposite hole 
of each pair from what it did for all the other groups. The 
subjects of Group la were like those of Group I in that they 
learned to avoid the buzzing holes (BWa) but the holes which 
group Ia avoided were the alternates of those avoided by 
Group I. 

Groups III and IV were the experimental groups. For 
them the electric shock was added to the buzzer. That is, in 
the set-up as used for these groups a second circuit was intro- 
duced which ran between the bare metal stylus as held by the 
subject in the one hand and an electrode held by him in his 
other, passive, hand. This second circuit was the secondary 
from an inductorium; and the wiring was so arranged that 
when the stylus made contact with the plate at the back of the 
hole it caused not only the buzzer to ring as for Groups I and 
II and Ia but in addition caused an electric shock to pass 


through the subject by way of the stylus in his one hand and 
the electrode in his other. 
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Group III was like Group I save for the addition of the 
electric shock. That is, the subjects in this group were re- 
quired to learn to avoid the hole in each pair which both buzzed 
and gave an electric shock. This group will be referred to as 
the Bell-Shock-Wrong group (BSW). 

Group IV was like Group II save for the addition of the 
electric shock. That is, the subjects in this group were re- 
quired to learn to select the hole in each pair which both buzzed 
and gave an electric shock. This group will be referred to as 
the Bell-Shock-Right group (BSR). 

All the subjects were students in the beginning class in 
psychology. The sizes of the groups and their distributions 
as to sex are shown in table I. 


TABLE 1 
DISTRIBUTION OF SUBJECTS 














Men Women Both 

ss caee ence essences 29 17 46 

EE, os a ean cence ee eeennes 31 16 47 
III. pene e seaae cna aii ie deka 17 17 34 
BV. BPCOCEIRINE So occ ccc ce cc ccccees 17 17 34 
TE iin km an tm 00g Kae OA die 12 13 25 
ims weeehesenie aks as ee ens 106 80 186 














The instructions given to all the subjects were as follows: 
“This is a learning experiment. There are thirty pairs of 
holes on this board numbered in order from one to thirty. 
One of each of the pairs of holes rings a buzzer (in the case of 
Groups III and IV—rings and shocks) and the other hole does 
not. I want you to place this stylus in each pair of holes from 
one to thirty. You will have three trials in which you are to 
hit each hole of each pair. After you have done this, I will 
tell you whether to learn to hit the ringing or non-ringing 
series (the ringing-shocking or non-ringing, non-shocking 
series). You may begin.” 

The three trials, during which the subject hit all the holes 
in order, constituted the fore-practice. 

After the subject had finished these first three trials, he 
was then told which series he should learn. He was further 
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told to correct himself when he made an error by placing the 
stylus in the right hole of the pair. 

When shock was applied, the subject was asked to regulate 
the inductorium to give the maximum amount of electric 
current which he felt he could stand. An attempt to offset 
the factor of adaptation was made by increasing the strength 
after each trial. 

One correct trial was used as the criterion of learning. 
In a preliminary experiment it was found that a criterion of 
two correct trials did not affect the results appreciably. If 
the criterion had not been satisfied by the end of the twentieth 
trial, the experiment was discontinued. The experiment 
required about a half-hour of continuous performance. 











TABLE 2 
Mean Errors 
Trial Group Ia Group I Group II Group III Group IV 

BWa BW BR BSW BSR 

eevee 11.96 II.9I 11.72 13.41 11.18 
ise aera nd 11.96 12.19 11.09 12.26 10.23 
eee 11.12 11.04 10.49 12.18 10.26 
Dike ta scons 10.56 10.85 8.91 11.70 9.73 
eee 9.64 10.41 8.45 10.20 9.18 
iindidneen 7.96 8.41 6.81 8.91 7.73 
| eer 8.20 7.96 6.34 8.12 6.91 
Dessvaeseos 6.56 6.87 5-57 7.91 6.15 
ke uaaned 5-56 5-93 5.40 7.18 5-03 
issakdccan 4-72 6.00 4.91 5-79 4.35 
ae 3.48 4.61 4.09 5-59 4.09 
re 3.24 4.00 3.57 4.97 3.97 
eer 3.24 3.28 3.02 4.59 3-12 
rr ere 2.56 2.83 2.79 3.59 2.79 
Piet excesee 2.08 2.52 2.21 2.82 2.32 
eee ee 1.56 2.09 2.28 2.47 2.06 
are 1.00 2.15 2.04 2.41 2.15 
Pe saet saaace 1.32 1.61 1.68 1.79 1.56 
errr 1.08 1.39 1.45 1.59 1.50 
ee 84 1.13 1.04 1.32 1.15 




















Results—Table 2 presents the mean numbers of errors 
in each group for each of the 20 trials. And figure 2 presents 
this same data in the form of cumulative error curves ?* for 
each of the four main groups. Considering these latter it 
appears at once that the Bell-Right and the Bell-Shock-Right 


* For the use of cumulative curves as a desirable method of ‘smoothing’ we are 
indebted to L. L. Thurstone. 
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curves are better (not worse) than the Bell-Wrong and the 
Bell-Shock-Wrong curves. It also appears that the difference 
between Bell-Shock-Right curve and the Bell-Shock-Wrong 
curve is greater than that between the simple Bell-Right curve 
and the simple Bell-Wrong curve. And, be it noted, all these 
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facts are contrary to expectation according to any ordinary 
formulation of the Law of Effect. In fact the only one of the 
results which suggests any comfort to this usual law is the 
fact that the Bell-Shock-Right curve is perhaps a little bit 
worse than the simple Bell-Right curve. In order to be cer- 
tain of the significances of all these differences we must, how- 
ever, consider the statistical values, as given in Tables 3 and 4. 

From these tables it seems clear that all the differences are 
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TABLE 3 


Means, STANDARD DEVIATIONS, AND ProBABLE Errors OF THE MEANS FOR THE 
Four Matin Groups 






























































Bell-Wrong (1) Bell-Right (11) 

Trials 
Mean $i. P.B.w Mean S$. Dp. P.B.o 
ade OCCT Oe 56.41 12.00 1.19 50.62 14.52 1.43 
nr 35.17 17.87 1.78 29.30 16.45 1.62 
II-I5 17.19 14.63 1.45 15.66 14.19 1.39 
56-00....... 8.37 10.82 1.08 8.43 11.13 1.09 
a eee 117.22 48.72 4.85 103.81 49.66 4.89 

Bell-Shock-Wrong (III) Bell-Shock-Right (1V) 
Trials 
Mean S.D. P.E.m Mean S.D. P.E 
I- §.. 59-76 11.33 1.31 50.50 12.24 1.42 
On eins 37-91 15.08 1.75 30.18 15.88 1.84 
| eee 21.50 16.14 1.87 * 16.29 14.73 1.70 
ae 9.59 11.99 1.39 8.47 11.19 1.29 
PR cccce 128.82 45-99 5-32 105.7I 48.44 5.60 
TABLE 4 


CoMPARISON OF GrRouPS WITH ONE ANOTHER 


Diff. = difference between means of groups; P.E.gi¢t, = probable error of differ- 












































ence; C.R. = critical ratio or difference divided by P.E.git¢.; P = chances in 100 of 
such a difference being a true difference and not one due to chance. 
Bell-Wrong (I) minus Bell-Shock-Wrong (III) minus 
Bell-Right (II) Bell-Shock-Right (IV) 
Trials 
Diff. | P.E.gig | C-R. P Diff. | P.E.qig | C.R. P 

I- §..| 5.79 1.86 3.11 96 9.17 1.93 4-75 100 
6-10..| 5.87 2.40 2.44 go 7.73 2.53 3.05 96 
II-I5..| 1.53 2.02 76 39 5.21 2.53 2.06 83 
16-20..} — 1.53 — .04 3 1.12 1.90 59 31 

I-20..] 13.41 6.88 1.95 81 23.11 7.73 2.99 96 

Bell-Shock-Wrong (III) minus Bell-Shock-Right (IV) minus 
Bell-Wrong (1) Bell-Right (11) 
Trials 
Diff. | P.E.qig | C-R. P Diff P.E.gig, | C.R. P 

Im §.. 3-35 1.77 1.89 80 — .03 2.01 —.O1 00 
6-10..| 2.74 2.49 1.10 54 88 2.45 36 19 
II-I5..| 4.31 2.37 1.82 78 63 2.20 .29 16 
16-20..| 1.22 1.75 .69 36 .04 1.70 .02 I 

I-20..| 11.60 7.19 1.61 72 1.90 7.43 25 13 
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probably significant save that between the simple Bell-Right 
curve and the Bell-Shock-Right curve. Bell on the wrong 
holes makes it more difficult to learn the total task than bell 
on the right holes. And, adding shock to the wrong holes in 
addition to bell makes the total task even more difficult. Bell 
on the right holes and bell plus shock on the right holes make 
the total task easier—but there is no very clear difference 
between bell alone and bell plus shock. 

Before, however, attempting any final interpretation of the 
above results we must consider the results of Group Ia. For 
it must be noted that where the Bell-Wrong and the Bell- 
Shock-Wrong groups learned one pattern of holes (the un- 
starred series in Fig. 1), the Bell-Right and the Bell-Shock- 
Right learned the other pattern of holes (the starred series in 
Fig. 1). And it might be that the obtained differences, that 
is, the general superiority of the Bell-Right and the Bell- 
Shock-Right groups over the other two, was merely due to 
the one pattern being more difficult than the other. It was 
to test this out that Group Ia was run. For this group the 
holes unstarred (Fig. 1) were connected with the buzzer. 
In other words, Group Ia like Group I were a Bell-Wrong 
group but actually they were required to select the opposite 
set of holes from those of Group I—that is, to select the same 
absolute holes as those selected by Group II and Group IV, 
the Bell-Right and the Bell-Shock-Right groups. Group II 
and Group IV should therefore also be compared with this 
Group Ia to make sure that the differences we found between 
them and Group I were not meretricious and due merely to 
differences in the pattern of holes as such. In Fig. 3 we show 
the cumulative curves for Group I, Group Ia, and Groups II 
and IV, and in Tables 5 and 6 we show the statistical values 
for this group and for this group compared with Group I and 
Groups II and IV. 

It appears from these tables and from the curves in Fig. 3 
that Group Ia was somewhat better than Group I but not as 
much better as were Group II or Group IV. We conclude 
that the Bell-Right and Bell-Shock-Right were in all proba- 
bility really better than Bell-Wrong and really even still better 
han Bell-Shock-Wrong. 
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Before turning to our final conclusions and interpretations 
we may note the high reliabilities of the individual differences 
within each of the four main groups. These reliabilities are 
presented in Table 7. The coefficients were computed by 
correlating the errors on the first five odd trials with the errors 
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on the first five even trials, and estimating the reliability of all 
ten trials by means of the Spearman-Brown formula. The 
coefficients are all over .go, indicating that whatever ability is 
measured by performance on this task is being measured con- 
sistently throughout the ten trials. 

Inter pretations.—We turn now to the question of interpre- 
tations. 

First, let us seek to explain the effects of just the bell 
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TABLE 5 


MEAN, STANDARD DEVIATION, AND PROBABLE ERROR OF THE MEAN FOR THE BELL- 
Wronc Group (Ia) wuicn LEARNED THE PATTERN OpposiTE 
THAT OF THE BeLt-Wronc Group (I) 
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Bell-Wrong (Ia) 










































































Trials 
Mean S.D. P.E.m 
Se 55-24 11.42 1.54 
i Cbudenreeacees 33.08 13.81 1.86 
ere 14.60 15.81 2.13 
6-46.04 a eee ee 5.80 11.69 1.58 
I-20 108.64 45.84 6.18 
TABLE 6 
Comparison OF Group Ia witH Eacu oF THE OTHERS 
(See Table 3 for explanation of column headings) 
Bell-Wrong (I) minus Bell-Wrong (Ia) minus Bell-Wrong (Ia) minus 
Bell-Wrong (Ia) Bell-Right (11) Bell-Shock-Right (IV) 

Trials 

Diff. | P-E.gig | C-R.| P | Diff. | P-E-gig | C-R. | P | Diff. | P-E.gig | C.R. | P 
I- §.| 1.17] 1.95 .60|31}] 4.62| 2.10 2.20|86] 4.74] 2.09 2.27 | 88 
6-10.| 2.09] 2.57 81/41] 3.78| 2.47 1.53/69] 2.40| 2.62 1.11 | 54 
II-15.|2.59| 2.58 | 1.00|50]—1.06| 2.54 |— .42]21]—1.69] 2.73 |— .62]31 
16-20.| 2.57] 1.91 | 1.35 |64]—2.63| 1.92 |—1.37|64]—2.67] 2.04 |—1.31] 62 
1-20.|}8.58| 7.86 | 1.09/54] 4.83] 7.88 61/31] 2.93] 8.34 35/19 

TABLE 7 


RELIABILITY COEFFICIENTS FOR THE Four Main Groups 


Comparable Scores Gotten by Taking First Five Odd Trials Against First Five Even Trials 








Reliability Coefficients 








Groups Correlation Between Reliability Coefficient 
n First 5 Odd and of First ro Trials 
First 5 Even Trials (Brown-Spearman Formula) 
Bell-Wrong............. 46 .906 951 
Bell-Right..............] 47 941 970 
Bell-Shock-Wrong....... 34 854 921 
Bell-Shock-Right........ 32 9OI .948 














alone. For it appears quite definitely that the bell simply 
by itself when it occurs on the right responses and is the signal 
for those responses makes learning easier than when it occurs 
on, and is the signal for, the wrong responses. But bell alone 
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seems hardly a matter of either reward or punishment. 
Rather it seems to be a signal or emphasizer. It signals to 
the subject that the response just made is right (or vice versa 
that it is wrong) and it thereby also emphasizes the one hole 
or the other. And, turning now to the actual outcome, it 
appears that when this signal or emphasizer is on the correct 
hole it is easier for the subject to learn than when it is on the 
incorrect hole. And, if we may envisage learning as consist- 
ing in the building up of little subordinate gestalts in which 
the one hole of each pair is to be set off against the other 
hole of the pair,’ this is perhaps just what we might expect. 
For it would seem quite reasonable that if the positive em- 
phasis, or accent, is on the to-be-selected hole, learning should 
be easier than when this positive emphasis or accent is on the 
to-be-avoided hole. Indeed it would seem that the present 
finding is really in line with that obtained by Thorndike (2), 
p. 45, when he discovered that it aided learning more to tell a 
subject when he was right than to tell him when he was wrong. 
It does not seem to us, in short, reasonable to interpret either 
bell or the mere words ‘right’ and ‘wrong’ as rewards or 
punishments. The true rewards and punishments—satisfiers 
and annoyers—are the ultimate ‘ getting-to’ or ‘not-getting-to’ 
the end of the task and the getting there successfully. The 
effect of the bell (or of the word ‘right’ or ‘ wrong’) is merely to 
emphasize. And our finding is that an emphasis upon success 
is more helpful than an emphasis on lack of success. The 
emphasis upon the latter would seem to act, perhaps, as a bale- 
ful fascination which attracts the performer and which makes 
the avoiding of the response in question the more difficult. 
In the second place, we must seek to consider the results 
when the shock was added to the bell. Now the shock was 
something of a punishment in and of itself. It acted as a 
motivator. And, if this were all, then the group who received 
the shock upon the wrong responses (1.¢., Group III), the Bell- 
Shock-Wrong group, should have learned faster than the 
corresponding group who did not receive shock (1.¢., Group I), 


‘For an elaboration of the present argument see Tolman (3), Chapters XXII 
and XXIII especially, 344-346 and 362-367. 
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the simple Bell-Wrong group, since learning in Group III 
would be in accord not only with the motive for general suc- 
cess but also in accord with that for an avoidance of shocks. 
Actually, however, the Bell-Shock-Wrong group, Group III, 
learned more slowly than the Bell-Wrong group, Group I. 

Now one way that we can explain this would seem to be 
to point out that shock is also an additional emphasizer and 
that, as such an additional emphasizer on the wrong responses, 
it would work against rather than for more rapid learning. Its 
emphasizing effect here tended to counteract its effect as 
something in and of itself to be avoided. 

Finally, however, let us consider the effect of the shock 
upon the right responses. As emphasis the shock on the right 
responses should according to our above reasoning have aided 
learning. As punishment in and of itself it should neither have 
aided nor hindered since, whether or not the wrong (and in this 
case unpunished) response was given first, the right and 
punished response always had to be given afterwards. Inother 
words the avoidance of learning did not, in this set-up, diminish 
the amount of shock in each trial. Indeed considering all the 
trials together it would appear that the fewer the trials the 
smaller the total amount of shock so that, if anything, the 
shock for this group also should have caused faster learning. 
Actually, however, we see that the Bell-Shock-Right group 
did not do consistently better than the Bell-Right. Neither, 
however, did it do appreciably worse. 

Still another hypothesis seems to be needed. It has sev- 
eral times been reported by other investigators that electric 
shock as such has a disrupting or distracting effect (1, 4). As 
such it would tend to hinder learning wherever it is intro- 
duced. Introduced after the wrong responses it would tend 
to cause distraction and to prevent them being remembered 
as wrong, and introduced after the right responses it would 
tend similarly to cause distraction and to interfere with their 
being remembered as right. In short, on this hypothesis, this 
disruptive effect of the shock would, when added to the wrong 
responses, work in the same direction as its emphasizing effect 
and further hinder learning. And this would help to explain 
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the decidedly greater poorness of the Bell-Shock-Wrong group 
compared with the Bell-Wrong group. And when added to 
the right responses this disruption effect of the shock would 
work against its emphasizing effect. And this would explain 
why the results for the Bell-Shock-Right group are not clearly 
worse or better than those for the Bell-Right group. 

Conclusion.—In conclusion it may be well to note that 
actually we have been led to suggest three laws of learning as a 
substitute for and correction of Thorndike’s law of effect. 
These would be a Law of Motivation, a Law of Emphasis and 
a Law of Disruption. 

By a Law of Motivation we would mean the assumption 
that the ultimate reason for learning seems to be the obtaining 
of final successes or of final failures. (And, insofar as the 
avoidance of shocks is in and of itself a potent goal, then the 
introduction of shock to the wrong responses will favor learn- 
ing.) 

By a Law of Emphasis we would mean the assumption 
that in a trial and error situation learning consists in building 
up little patterns or gestalts in which the correct response is 
set Over against the incorrect one or ones and the further as- 
sumption that an accent or emphasis (such as bell or shock) 
upon the correct responses will tend to favor learning whereas 
an accent or emphasis upon the wrong responses will tend to 
hinder learning. 

By a Law of Disruption we would mean the assumption 
that any relatively violent negative emotional stimulus (such 
as an electric shock) coming in immediate sequence upon either 
the right or the wrong responses will, as such, tend to disrupt 
learning. 

Finally we may indicate once again the application of all 
three of these laws to our actual findings. 

The Bell-Wrong group should be worse than the Bell-Right 
group by virtue of the Law of Emphasis and this was the ac- 
tual outcome. 

The Bell-Shock-Wrong group when compared with the 
Bell-Wrong group should be worse than the latter by virtue 
of both the Law of Emphasis and the Law of Disruption. It 
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should be better than the latter by virtue of the Law of Motiv- 
ation. Actually it was worse. 

The Bell-Shock-Right group should be better than the 
Bell-Right group by virtue of the Law of Emphasis and pos- 
sibly also a little bit better by virtue of the Law of Motivation. 
It should, however, be worse than the Bell-Right group by 
virtue of the Law of Disruption. Actually it was about the 
same as the simple Bell-Right group. 

One last word—the present experiment and this final 
discussion are presented for their suggestive value and not in 
any belief that they are definite or final. Our three ‘laws’ are 
mere tentative hypotheses to be further investigated. 
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THE PREDICTION OF VOCATIONAL APTITUDE AND 
SUCCESS FROM PHOTOGRAPHS! 


BY MORRIS S. VITELES AND KINSLEY R. SMITH 


University of Pennsylvania 


OBJECTIVES 


The objectives of the study described in this report were: 
(1) To investigate the accuracy of judgments on vocational 
aptitude and success or failure made by experienced 
personnel workers, from an examination of photo- 
graphs. 

(2) To determine the degree of certainty with which such 
judgments are made. 

(3) To analyze the specific physical and other signs employed 
when making such judgments. 

(4) To compare the judgments and degree of certainty of 
experienced personnel workers with those of observers 
inexperienced in the selection and management of 
workers. 


SUBJECTS AND PROCEDURE 
(A) The Subjects 


The procedure employed in this investigation followed that 
used by Landis and Phelps in a study published in 1928.? 
The only important difference is in the choice of subjects. 
The subjects of the investigation by Landis and Phelps were col- 
lege students. Inthe present study judgments were made by 
twenty-four members of a personnel association of a large 
Eastern city. The majority of the subjects are employment 
managers or interviewers in industrial plants. A few are 
employed in placement work in public and private employ- 


1 Paper read at the annual meeting of the American Psychological Association 
Toronto, Sept. 12, 1931. 

? Landis, C. and Phelps L. W., ‘The Prediction from Photographs of Success and 
of Vocational Aptitude,’ this JournaL, 1928, 11, 313-24. See this article for review 
of earlier studies. 
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ment agencies. Practically all have had occasion to use 
photographs in making judgment on personality in connection 
with employment or placement. Nearly every member of the 
group is acquainted with one or another physiognomic system 
of character analysis. Approximately ten of this group of 
personnel workers have received formal training in physiog- 
nomic systems of character analysis, and three or four make 
strict applications of such systems in their daily work. 

In the opinion of the authors, the most significant feature 
of this experiment is the opportunity which it allows for a 
direct comparison of the judgments of observers experienced 
in personnel work with those of subjects untrained in the art 
of selecting and placing workers, and without experience in 


this field. 
(B) The Material 


The material employed in this experiment was obtained through the courtesy of 
Dr. Carney Landis. It consists of two photographs of five successful and five un- 
successful men in each of the fields of law, medicine, education and engineering, making 
a total of twenty successful and twenty unsuccessful men. Landis and Phelps describe 
as follows the method and criteria employed in the selection of the photographs. 

“An alumni publication of a large American university was secured. This 
publication was issued on the 25th anniversary of the graduation of a class of 850 men 
and it contained an autobiographical sketch of each man together with his photograph 
at the time of graduation and at his 25th anniversary of graduation. For the purposes of 
this study, all autobiographies of men who had gone into law, medicine, education 
and engineering were read and assigned a rating from 1 to 10 on the basis of their 
apparent wordly success. For instance, a lawyer who had risen to be one of the 
chief attorneys for the Standard Oil Company was judged relatively successful, while 
one who was acting as clerk in a large law office was relatively unsuccessful. A man 
who had taught for 25 years in the common schools was judged relatively unsuccessful, 
while one who had risen to the superintendency of schools in a large city was judged 
relatively successful. In this study it must be borne in mind that success or non- 
success relates only to the vocation and is always a relative term—relative to the 
performance of other members of his own college class. 

“From the ratings thus assigned the five most successful and the five least success- 
ful in each of the four groups, law, medicine, education and engineering were taken. 
The published portraits were rephotographed and mounted on separate cards.” 


Landis and Phelps point out the advantage of this method 
of selecting photographs for use in an experimental investiga- 
tion of predicting success or failure from photographs. The 
selection was made not on the basis of the appearance of 
success or failure, but on the basis of an autobiography which 
gives a clear picture of how successful each individual has been 
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in the period of 25 years subsequent to graduation from 
college. In this respect, as Landis and Phelps point out, the 
investigation differs from earlier studies, such as those of 
Gaskill, Fenton and Porter * and of Pintner,‘ in which photo- 
graphs were not selected on the basis of complete factual 


evidence with respect to the ‘strength’ of the trait under 
consideration. 


(C) Procedure 


The photographs were divided into two groups,—one of ‘younger’ men, including 
the photographs taken at the time of graduation, and one of ‘older’ men, including 
the photographs taken at the 25th anniversary of graduation. ‘The forty photographs 
in each group,—twenty of successful and twenty of unsuccessful men,—were scattered 
in random order with respect to success and with respect to actual vocation. The 
photographs in each group were projected one at a time upon a screen. The photo- 
graphs of the ‘successful’ and ‘unsuccessful’ or ‘failure’ men, taken at the time they 
were college seniors, were presented first. Shortly afterwards the pictures of the 
‘older’? men were shown. However, the subjects were not informed that the two 
groups of pictures were of the same men. 

Each subject was given two blanks on which to record his judgments. Directions 
on the prepared blanks were read aloud while the observers followed them from their 
individual copies. The blank used by the observers in recording judgments in the 
case of the younger men bore the following instructions: 

“This experiment is designed to test your ability to predict from photographs 
the relative vocational success or failure of college seniors, and your ability to predict 
what profession, vocation, craft, or trade these men will follow. 

“T will project singly on the screen the photographs of 40 young men. The 
photographs of these young men were taken when the men were seniors in college 
some twenty-five years ago. Certain of these pictures are of men who will be very 
successful in the professions which they follow. Others of the pictures are of men 
who will be relatively unsuccessful in their professions. Note then that there are two 
groups, one of very successful men and one of relatively unsuccessful men. 

“Look at each picture in turn and then record your judgment in the appropriate 
column. 

“In the column headed ‘Success or Failure’ place an ‘S’ or ‘F.’ ‘S’ signifies 
that you believe that the man will be relatively successful in whatever vocation he 
may follow. ‘F’ means that he will be relatively unsuccessful. 

“In the next column record the ‘Percent Certain’ of this judgment. o percent 
means a hunch or no feeling of certainty whatsoever; 50 percent moderately certain; 
and 100 percent means positive conviction. 

“In the column headed ‘Vocation’ enter the number from the card (Form C A 1) 
indicating profession, trade, etc., which you would recommend. 

“In the column headed ‘Basis of Judgment’ enter the number from the card 
(Form C A 1) showing the reason best describing your basis of judgment.” 





3 Gaskill, P. C., Fenton, N., and Porter, J. P., ‘Judging the Intelligence of Boys 
from Their Photographs,’ J. Appl. Psychol., 1927, 11, 394-404. 
‘ Pintner, R., ‘Intelligence as Estimated from Photographs,’ Psychol. Rev., 1918, 


25, 286-96. 
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Below the instructions were six columns headed as follows: 


Picture Success Percent Vocation Basis of Remarks 
Number or Certain Judgment 
Failure 








There was a short intermission between the projection of the pictures of the 
‘younger’ and of the ‘older’ groups. As indicated above, the observers were not 
told that the two groups of pictures were of the same men. The blank and the in- 
structions used in obtaining the judgments on the photographs of the older men were 
largely the same as those employed in the case of the younger men, but the observers 
were instructed to make a judgment on whether the man had been successful, and to 
indicate the profession which he had followed. The first two paragraphs on the blank 
used in obtaining judgments on photographs of the older men read as follows: 

“This section of the experiment is designed to test your ability to judge experi- 
mentally from photographs what vocation, profession, or trade men have followed for 
25 years, and whether they have been very successful or relatively unsuccessful. 

‘I will project singly on the screen the photographs of 40 men. These men had 
been graduated from college some 25 years at the time the photographs were taken. 
Certain of these pictures are of men who have been very successful in the profession 
which they have followed. Others of the pictures are of men who have been relatively 
unsuccessful in their professions. Note then that there are two groups, one of very 
successful men and one of relatively unsuccessful men.” 

Each subject was handed a sheet (Form C A 1) showing a classification of possible 
vocations, and a list of ‘reasons for judgments.’ A section of the list of ‘possible 
vocations’ and the entire list of ‘reasons for judgments’ are reproduced below. 


FormCAt1 


PossiBLE VocaTIONs REASONS FOR JUDGMENTS 


1. Law 1. Shape of face 
1A General practice 





1B Politics 
2. Business 
2A Wholesale merchandising 
2B Retail merchandising 
2C Finance 
2C 1 Banking 
2C 2 Brokerage 
2C 3 Insurance 
3. Education 
3A College instruction 
3B College president 
4. Intellectual vocations other 
teaching 
4A Clergyman 
4B Scientist 
4C Inventor 
4D Author 
4E Poet 


2. Appearance of eyes 

3. Length of nose 

4. Shape or size of mouth 

5. Width of forehead 

6. Height of forehead 

7. Shape or size of ears 

8. Condition (neatness, etc.) 
9. Facial expression 


10. Position of head 
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Form C A 1—continued 


PossiBLE VOCATIONS REASONS FOR JUDGMENTS 


5. Engineering 11. Similarity to some acquaintance 


5A Electrical 

5B Mechanical 12. Observation of persons of similar 
6. Medicine appearance 

6A General practice 

6B Surgeon 13. General impression 
7. Agriculture 

7A Farming 14. Hunch 

7B Forestry 

7C Ranching 15. Snap Judgment 

7D Country gentleman 
8. Gentlemen of leisure 16. Unclassified 


SUMMARY OF FINDINGS 


1. The experimental evidence obtained in this investiga- 
tion is in contradiction to the claims of physiognomists, and to 
the popular belief of personnel workers, that vocational apti- 
tude and success can be judged from photographs. 

2. The prediction and judgment of vocational aptitude and 
of success or failure from photographs by personnel workers 
are no better than chance. 

3. The judgment and prediction of experienced personnel 
workers are no more accurate than those of untrained ob- 
servers (college undergraduates). 

4. In spite of the fact that the accuracy of prediction 
and of judgment of personnel workers is no better than that 
of college students, personnel workers are more certain of the 
accuracy of their judgments. 

5. Personnel workers seem to have a bias toward success. 
A higher percentage of ‘success’ judgments reveals a degree 
of optimism with respect to vocational success greater than 
that of college students. 

6. The judgments of personnel workers, as the judgments 
of college students, do not seem to be based upon any single 
system of character analysis. They depend largely upon 
general impression, facial expression, and resemblance to an 
acquaintance. 

The findings upon which these conclusions are based are 


embodied in the accompanying tables and charts. 
41 
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ANALYSIS OF FINDINGS 


(1) The Ability of Personnel Workers to Judge Vocational 
Success from Photographs of ‘Older’ Men 


Table I and Chart III present evidence on whether 
personnel workers can judge vocational success from the ex- 
amination of photographs of men in professional work who 
have been out of the University for twenty-five years. 

The twenty-four personnel workers recorded a total of 
475 judgments in the case of the ‘most successful’ older group, 
and a total of 478 judgments with respect to the vocational 
success of the ‘least successful’ or ‘failure’ group. The table 
shows that 82.7 percent of the judgments of ‘successful’ men 
are ‘success’ or correct judgments. The percentage of wrong 
or ‘failure’ judgments in the case of the older ‘successful’ 
group amounts to only 17.3 percent. On the other hand 
76.8 percent of the judgments of the ‘least successful’ or 
‘failure’ group are also ‘success’ judgments. In other 
words, only 23.2 percent of the judgments concerning the 
degree of success of the ‘least successful’ or ‘failure’ group 
are correct. 

If we combine the judgments from which these percentages 
are derived it is found that in the case of the personnel workers 
52.8 percent of the judgments are correct (Chart III). In 
other words, the percentage of correct judgments is very 
slightly better than chance. Ifthe members of this group had 
tossed coins instead of attempting to make a deliberate judg- 
ment on the relative success of the men whose photographs 
were shown, the results would have been just as accurate as 
those obtained through deliberate judgment. 

If the photographs of the ‘older’ group are considered 
individually the invalidity of judgments on ‘success’ or ‘fail- 
ure’ is again evidenced. Of the 20 ‘older’ men who were 
actually ‘successful’ 19 were judged ‘successful’ by a large 
majority of the observers. Of the 20 who were actually ‘least 
successful’ or failures, 18 were judged ‘successful’ by the 
majority of personnel workers. It is interesting to note that 
in the case of four photographs of the ‘older’ group there is 
unanimous agreement with respect to degree of success. All 
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observers unite in making a ‘success’ judgment in the case of 
512-0 who actually belongs in the ‘successful’ group. At the 
same time all observers unanimously place F22-0, F25-0 and 
F28-o in the ‘most successful’ group, although these three 
actually belong in the ‘least successful’ group. In other 
words, in cases of individual photographs where unanimous 
agreement exists among the observers, the observers unani- 
mously agree on wrong judgments in three cases out of four. 


CHART N®* TIT 


PERCENTAGE OF CORRECT JUDGMENTS 
ON “SUCCESS ANDO “FAILURE™ 













































































“OLDeR"GrouP “YOUNGER “GRouP YOUNGER’ GROUPS 
COMBINED 
828 
- vd 22 ons S25 ses 
GY _ 
, YS / 4) y, 
= 30 Wh D, Yi} 
a 4 
ri 20 Y Ui by 
Gy Vi 
Y J GF 
WY @D YW} 
[| PERSONNEL WORKERS COLLEGE STUOENTS 


(2) The Ability of Personnel Workers to Predict Vocational 
‘Success’ from Photographs of ‘ Younger’ Men 


In Table II and Chart III are embodied results of the 
section of the experiment dealing with the prediction of 
vocational ‘success’ or failure from an examination of the 
photographs of the ‘younger’ men. 

The twenty-four personnel workers recorded a total of 
472 judgments in the case of the ‘most successful’ younger 
group, and a total of 473 judgments with respect to the voca- 
tional ‘success’ of the ‘least successful’ or ‘failure’ group. 
Table II shows that 80.5 percent of the predictions in the case 
of the ‘successful’ younger men are ‘success’ or correct predic- 
tions. The percentage of wrong or ‘failure’ predictions in the 
case of the ‘younger’ ‘most successful’ group amounts to only 
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19.5 percent. On the other hand, 75.9 percent of the predic- 
tions of the ‘least successful’ or ‘failure’ group arealso ‘success’ 
predictions. If figures from which these percentages are de- 
rived arecombined, it is found that §2.2 percent of the predictions 
are correct. (Chart III.) In other words, the percentage of 
correct predictions on ‘success’ and ‘failure’ in the case of the 
‘younger’ men is no better than chance. 

An examination of the judgments on individual photo- 
graphs of the ‘younger’ group in Table II shows again a 
somewhat uniform agreement with respect to classification of 
the photographs. However, in the ‘younger’ group there is 
no photograph in the case of which the personnel workers 
agree unanimously with respect to the classification on degree 
of ‘success.’ 


(3) Ability of Personnel Workers to Predict and Judge 
Vocational Aptitude 


A review of the judgments on the actual vocations of the 
‘older’ men and of the recommendations with respect to voca- 
tions to be followed by the ‘younger’ men shows little agree- 
ment among the personnel workers. There is a wide range 
in the vocations assigned to each photograph. In the case 
of only twelve photographs do five or more judges agree on the 
vocation to be assigned to a man whose photograph is shown. 
In Table III are shown the photographs in the case of which 
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such agreement occurs. It is significant to note that in these 
twelve cases of comparative agreement there is only one, 
F36-0, in which the correct occupation is assigned. Eight 
personnel workers agree that S1o-0 is aclergyman. Actually 
he is a successful engineer. ‘Twelve personnel workers agree 
that S2-Y, a successful educator, is a clergyman. Similar 
discrepancies are found in the remaining cases. 


(4) The Feeling of Certainty of Personnel Workers 


The degree of certainty on the part of personnel workers 
in making judgments on success and failure is shown in Chart 
IV. In view of the fact that the judgments made by the 


CHART N* IV 
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personnel workers are actually no better than chance, an aver- 
age feeling of certainty of 53.7 percent from all photographs 
can be described as extremely high. It is interesting to note 
that the range of average feeling of certainty for individual 
photographs when judged by the personnel group is from 40 
percent to 73 percent. There is no significant difference in 
the feeling of certainty in predicting success and failure for the 
‘younger’ group as compared with the feeling of certainty 
when passing judgment on the ‘older’ group. 
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(5) The Physical Bases of Prediction and Judgment of 


Vocational Aptitude and Success and Failure 


In Chart V is shown the frequency with which each basis 
of judgment is employed by personnel workers in recording 
their predictions and judgments on the success and failure and 
the vocational aptitude of both the ‘older’ and ‘younger’ men. 
Personnel workers rely upon general impression and facial 
expression in the case of 53.2 percent of their judgments. An 
analysis of the data shows that they rely upon these two cri- 
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teria in the case of 61 percent of their judgments of ‘older’ men 
and in the case of 44 percent of their predictions regarding 
‘younger’ men. It is significant to note that in few instances 
are the specific criteria of character analysis systems employed 
in arriving at a decision of success and failure. This is par- 
ticularly significant in view of the acquaintance of this group 
of personnel workers with the most frequently employed sys- 














PREDICTION OF VOCATIONAL APTITUDE 627 


tems of character analysis, and of the confidence of a number 
of them in their validity. 


(6) Comparison of Personnel Workers (Trained Observers) and 
College Students (Untrained Observers) 


In all of the tables and charts included with this report, 
the results of personnel workers are compared with those of 
untrained observers (college students) employed in the experi- 
ment by Landis and Phelps. In the tables, results of the 
college under-graduates used as subjects by Landis and Phelps 
are shown in italics. The cross hatched columns in the charts 
also show the results of the college undergraduates.°® 

An examination of Chart III shows that 52.8 percent of the 
judgments of personnel workers with respect to ‘success’ and 
‘failure’ of the ‘older’ group are correct as compared with 
51.3 percent ® in the case of college students. 

With respect to ‘success’ and ‘failure’ of the ‘younger’ 
group 52.2 percent of the judgments of personnel workers are 
correct as compared with 47.3 percent of the judgments of 
college students. In other words, the judgment of personnel 
workers with respect to ‘success’ and ‘failure’ of men whose 
photographs were shown, is very slightly better than that of 
college undergraduates, and like that of college students, is 
close to ‘chance.’ 

An examination of Tables I, II] and Chart III shows a 
rather interesting difference with respect to the tendency of 
judgment. Although there is no significant difference in the 
percentage of correct judgments between personnel workers 
and college students, personnel workers show a marked ten- 
dency to judge the photographs as ‘successful.’ Thus, in the 
case of the ‘older’ group 82.7 percent of the judgments were 


5 The figures for college students in Tables I, II, and Chart III are taken from 
a control experiment by Landis and Phelps, involving 93 students, in which directions 
had been changed so as to counterbalance a bias toward ‘success’.judgments which 
appeared in an earlier investigation. 

The figures for college students in the remaining tables and charts are cited from 
the original experiment by Landis and Phelps, involving 20 subjects. 

6 This figure was obtained by averaging the average percent ‘S’ judgments and 
the average percent ‘F’ judgments given by Landis and Phelps. Inasmuch as the 
number of cases remains constant in the figures cited by Landis and Phelps, the 
figures on percent correct obtained in this way and shown on Chart III are valid. 











628 MORRIS S. VITELES AND KINSLEY R. SMITH 


‘success’ judgments. At the same time 76.8 percent of the 
judgments with respect to failures were ‘success’ judgments. 
The same tendency is revealed in the case of the ‘younger’ 
group, the judgments of ‘success’ being 80.5 percent in the 
case of the ‘successful’ group and the prediction of ‘success’ 
75.9 percent in the case of the ‘least successful’ or ‘failure’ 
group. The undergraduates, on the other hand, furnish ap- 
proximately the same proportion of ‘success’ and ‘failure’ 
judgments in the case of ‘successful’ and ‘failure’ groups both 
for ‘older’ and ‘younger’ men.’ 

When the trained and untrained observers are compared 
with respect to their ability to judge vocational aptitude, or 
what vocation a man has followed or will follow for twenty- 
five years, the personnel workers exhibit no more ability 
than do the college students. Landis and Phelps, as the au- 
thors of the present study, found a wide range of vocations 
assigned to each photograph. It seems that the personnel 
workers agree less among themselves in determining the 
vocations than do the college students, for, in spite of the fact 
that there were twenty-four personnel judges and only twenty 
student observers, the results of the former reveal only twelve 
photographs in the case of which the same vocation is assigned 
by five or more judges, while the results of the latter show 
fourteen such subjects. However, the inaccuracy in choice 
of vocation is about as great in both groups. 

Landis and Phelps found the average feeling of certainty of 
college undergraduates in making ratings on ‘success’ and‘ fail- 
ure’ to be 32 percent when the ‘older’ and ‘younger’ groups are 
combined. (ChartIV.) Forthe combined groups the average 
feeling of certainty of the personnel worker is 53.7 percent, 
or approximately 1.6 times that of the college undergrad- 


7 Landis and Phelps found a high degree of optimism in the judgments of the 
photographs on the part of college undergraduates in their original experiment. The 
authors believed this to be due to the wording of the instructions. In a control 
experiment with revised instructions this tendency disappeared. The directions used 
in the present experiment were derived from the control experiment by Landis and 
Phelps. In spite of this fact, the percentage of ‘success’ judgments and predictions 
of personnel workers are actually in excess of those in the original experiment by 
Landis and Phelps, in which the instructions created a bias in the direction of ‘success” 
judgments. 
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uates. The chart shows that college undergraduates tend to 
be more certain of their judgments of ‘older’ men than in 
predicting ‘success’ and ‘failure’ of ‘younger’ men. This 
tendency is also present in the case of personnel workers. 

In Chart V is shown a comparison of the basis of the judg- 
ments employed by undergraduates and personnel workers. 
One of the most marked differences between personnel 
workers and college students is in general impression. The 
former depend upon general impression in the case of 36.4 per 
cent of the judgments, whereas college undergraduates depend 
upon this in the case of only 23.7 percent of their judgments. 
Another marked difference is to be found in similarity to 
some acquaintance. College undergraduates employ this 
criterion about four times as often as do personnel workers. 
Personnel workers do not tend to use the strictly physiognomic 


criteria of character analysis systems more frequently than do 
college undergraduates. 


GENERAL CONCLUSIONS 


In summarizing the results of their experiment, Landis and 
Phelps point out that their study “lends absolutely no evi- 
dence toward the confirmation of the claims of the physiog- 
nomists. It offers no evidence which would indicate that the 
inclusion of a photograph in a letter of application or on a 
personnel sheet is of any positive value.” This earlier experi- 
ment was open to the criticism that its subjects were un- 
trained college undergraduates whose experience in judging 
men was too restricted to permit an adequate judgment, 
regardless of the criterion employed. ‘The present experiment 
shows that personnel workers, in spite of their experience in 
handling men, can use the photograph to no better advantage 
than can college students. These findings are of particular 
significance in view of the very widespread use of photographs, 
not primarily in industrial plants, but in selection by college 
authorities and in placement by employment agencies, of men 


and women in teaching, research work, and in other forms of 
professional work. 


(Manuscript received October 1, 1931) 











THE FUNCTIONAL SIGNIFICANCE OF IMAGERY 
DIFFERENCES ! 


BY FRANK C. DAVIS 
University of California at Los Angeles 


In 1914 Watson (29) ‘dismissed’ the image from psychol- 
ogy, justifying his stand by pointing to an alleged lack of 
correlation between the verbal reports of Fernald’s (8) subjects 
and their objective performances in the ‘imagery’ tests de- 
vised by her. I have elsewhere (4) called attention to the fact 
that, had Watson been interested in making out a case for 
rather than against the image, he could have found numer- 
ous passages in Fernald’s monograph indicative of the presence 
of correlation between subjects’ reports and test scores, rather 
than the absence of correlation which he reported. Quite 
aside from this question of fact, however, it is clear that 
sweeping generalizations with respect to the objective signifi- 
cance of differences in imagery reports should not be based 
on results from a very small number of highly selected sub- 
jects. (Fernald’s group comprised from five to eleven sub- 
jects, all of them graduate students or instructors in psy- 
chology.) The present investigation was planned to include 
a sufficiently large number of subjects to make conclusions 
therefrom of some statistical significance. 


STATEMENT OF THE PROBLEM 


The purpose of the present investigation may be stated as 
follows: To determine to what extent differences in subjects’ 
reports as to their use of imagery in the performance of given 
tests are related to objective differences in test scores. 


1The present report represents an abridgment of a thesis submitted in partial 
fulfillment of the requirements for the Ph.D. degree at the University of California. 
The investigation here reported was begun in 1924; it was discontinued when I left 
the university to take up other work, and was resumed when I returned to the university 
in 1928. I wish to acknowledge my indebtedness to Dr. Warner Brown, under whose 
direction this experiment was carried out, and to Dr. E. C. Tolman, who first stimulated 
my interest in the question of the objective significance of differences in imagery 
reports. | 
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SUBJECTS 


The subjects were the students in the class in Elementary 
General Psychology. The majority of them were sophomores 
or juniors; a few were seniors or graduate students. The tests 
decided upon were administered as group tests to all the 
laboratory sections during the week when ‘Imagery’ was the 
topic under discussion in the class lectures. The number of 
subjects per section varied between fifteen and thirty. Each 
section gave fifty minutes to the experiment. 


MetTuHOoD OF REportT USED By SUBJECTS, AND PRELIMINARY 
PROCEDURE 


Before the administration of the tests was begun, the first 
ten minutes at each section meeting was given over to a brief 
recapitulation of the topic of imagery. The Galton ‘breakfast 
table’ type of procedure was gone through and the various 
possible modalities of images discussed. The names of the 
latter were written on the board and the subjects were told 
to use the first letters of each as abbreviations in their subse- 
quent verbal reports. 

Sometimes this recapitulation and discussion could not be 
arbitrarily terminated, promptly at the end of the ten minutes 
allotted, without an apparent sacrifice of rapport in the inter- 
ests of a rigid time schedule. Since the establishment and 
maintenance of as high a degree of rapport as possible between 
subjects and experimenter was deemed of first importance, 
alternate tests were provided for use when the time became too 
short for the completion of all of the regular series of tests. 

The subjects were always shown sample tests, and were 
instructed that at the end of each test they would be asked to 
write down the letter or letters representing the processes 
which they had used in recall or in recognition, as the case 
might be. If doubtful as to what imagery, if any, had been 
present they were told to put down a ‘?’. If sure that no 
imagery had played a part in their performance they were 
instructed to write ‘none’ at the end of the test or tests of 
which that seemed to them to be true. Where they were 
aware of more than one imaginal process in their performance 
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of a given task they were asked to underline the process or 
processes which seemed to them to be dominant, or if none of 
them seemed more dominant than the others to leave the 
letters un-underlined. (This was done only with the 1929 
subjects. Many of the 1928 subjects did underline one or 
more processes, though uninstructed to do so. The distinc- 
tion, seemingly, is difficult to make only in the case of those 
subjects whose imagery tendencies are ‘balanced,’ in the 
sense in which O’Brien (18) used that term. Most ‘mixed’ 
subjects report little hesitation concerning which process is for 
them dominant over the others.) 

No attempt was made to differentiate between kinesthetic 
imagery and kinesthetic sensations—the subjects were told 
to put down ‘K’ if they were aware of movements, or images of 
movements, from lips, tongue, or larynx, or from other 
regions of the body, as playing a part in their recall of the 
material presented, or in the associations which came to them 
in the case of the ‘Rhyming’ and ‘Childhood’ association 
tests. (The tests used, and the manner of their selection, will 
be described in following sections.) 

The tests proper were not begun until all questions had 
been answered, and until each subject apparently understood 
fully the various possible kinds of report. Then at the con- 
clusion of the first test—that for Tonal Memory (see descrip- 
tion following)—the experimenter always said: “‘ Now state as 
accurately as possible how, in your opinion, you performed this 
test. (Slight pause.) Did you form a visual schema or pat- 
tern of the notes as you heard them the first time, and again 
the second time, and compare the second pattern with the 
first? Some people report that they do this sort of test that 
way. (Slight pause.) Or were you aware of movements, 
perhaps very slight movements, from your finger, or hand, or 
head, or eyes, or vocal cords, or from some other part of your 
body as the notes were played, and did you make your com- 
parison of the two series of notes on the basis of those move- 
ments? Some people report that they do this sort of test in 
that way. (Slight pause.) Or did you concentrate on the 
sound of the notes in the first series, and then carry them in 
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your mind while you compared them with the notes of the 
second series? Some say they do it that way. Put down 
‘V’ if a visual pattern was your method, ‘K’ if you were aware 
of movements and made your judgment on the basis of them, 
and ‘A’ if you made the comparison in terms of the sounds. 
If you used a combination of these or other processes put 
down the letters which stand for the processes you used. If 
one of them seemed to you to be dominant over the others, 
underline it. If you are sure none of these entered into your 
performance of the test put down ‘none.’ If you are uncer- 
tain, put down a question mark.” ? 

At the end of each of the other tests the experimenter 
always said simply: ‘‘ Now tell as accurately as you can how 
you did it.” 

SELECTION OF TESTS 


In the matter of test selection I had the benefit of Fernald’s 
extensive researches in this field. I made my initial selection 
largely from among her ‘tests for more accurate analysis of 
type.’ I eliminated some of these on the basis of my own 
experience with them, modified others for group administra- 
tion, and added others of my devising. Among the tests 
which were eliminated were a ‘spelling backwards’ test, the 


2 Whatever element of suggestion entered into these remarks at the end of the 
Tonal Memory Test, I tried to weight in favor of reports of visual schema or kinzsthesis 
as against auditory imagery, by ‘suggesting’ the two former possibilities first. The 
extent to which subjects like these resist such suggestions, and the extent to which 
most of their reports are based on experiences the character of which seem to them 
rather unambiguous, was evident in their attitudes as they listened. A few sporadic 
nods of the head registered understanding and acquiescence in response to the ‘visual 
pattern’ and ‘kinesthesis’ suggestions. The majority, however, merely waited, or 
registered downright negativism; which changed to affirmative head noddings as soon 
as the ‘auditory memory’ possibility was advanced. The suggestion that the fact 
that the stimuli were auditorily presented might account for the preponderance of 
auditory imagery reported as dominant in the case of this test may be countered by 
pointing to the results of the ‘Rhyming’ test, where although the stimuli were visually 
presented the reports were nevertheless preponderantly of auditory images as dominant, 
and by emphasizing once more the conclusions reached by Segal (23), Fernald (8), 
Feuchtwanger (9), O’Brien (18) and others, that dominant imagery tendencies are 
for many subjects more important than the physical characteristics of the stimulus 
in determining how the latter will be attended to. Note, in further corroboration of 
this, how in Table 2 the subjects’ reports spread through various modalities and 


combinations of modalities no matter what the mode of presentation of the stimulus 
material. 
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reproduction of heard literary passages, the solving of physics 
problems and problems like the ‘colored cube,’ the ‘straight- 
ahead’ spelling of difficult English words, auditory span for 
numbers and letters, both forward and reversed, a ‘title of 
book—name of author’ association test, drawing from memory 
of a well-known face, free-hand drawing of the ‘ most-pleasing- 
rectangle,’ ink-blot test for number and kind of associations. 
Of the above, I consider the spelling backwards test, the 
reproduction of heard literary passages, auditory span for 
numbers and letters, and perhaps the ink-blot test, promising 
tests for the diagnosis of imagery differences when used in the 
testing of individuals singly rather than in groups. The spell- 
ing backwards test I attempted to modify for use as a group 
test by making it a ‘time-limit’ test and scoring for number of 
errors and omissions. The ‘time-limit’ is difficult to enforce, 
however, since many individuals find themselves so stimulated 
by the novelty and the difficulties of such a task that they 
cannot, at command, cease their endeavors and turn at once 
to something else. The reproduction of heard literary pas- 
sages took too much time if enough passages were used to 
furnish a measure of the test’s reliability. ‘The same was true 
of the ink-blot test. The solving of physics problems, in 
addition to consuming too much time, offered the further 
dificulty that many college students seem to find even the 
simplest problems in physics beyond them. ‘Difficult Eng- 
lish words,’ for straight-ahead spelling, has certain drawbacks 
as a differential test of imagery tendencies, chiefly, it would 
seem from a priori considerations, on the score of unequal 
training, since methods of teaching spelling differ and since 
teachers themselves differ in their ability to ‘put over’ what- 
ever methods are taught. However, when training may be 
considered to have been fairly constant there still is found 
great unevenness in spelling ability. It was this observation 
that impelled me to consider such a test for inclusion as a pos- 
sible imagery test, since it seemed conceivable that clear visual 
images in the case of both phonetic and nonphonetic spellings, 


3 De Voss (6) reports that of all the abilities measured by the Stanford Achieve- 
ment Test battery, spelling is apparently most nearly ‘independent’ of the rest. 
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or clear auditory images in the case of phonetic spellings, 
might turn out to be important components of the ‘spelling 
ability complex.’ My early try-out of the test unfortunately 
uncovered a rather large proportion of subjects who failed so 
consistently on the words offered them that there seemed to 
be some danger of the disaffection engendered by their failure 
carrying over to the rest of the tests, so this was crossed off the 
list. But I feel that the question of a possible relationship 
between imagery tendencies and spelling ability does merit 
further investigation. 


The elimination of the other tests which were tried was in 
every case on one of the following grounds: (1) scoring difficul- 
ties; (2) length of test necessary to assure anything like satis- 
factory reliability; (3) improbability, on the basis of a study 
of results from ‘try-out’ sections, that the test would show 
any relation to differences in imagery tendencies. 


DESCRIPTION OF TESTS SELECTED 


Results will be reported from the following tests: (1) The Seashore Tonal Memory 
test; (2) Memory for Simple Geometrical Figures; (3) Memory for Roman and Arabic 
Numerals; (4) Memory for Arabic Numerals; (§) Memory for Words Alike in Sound 
but not in Spelling; (6) Memory for Words Spelled Alike but Pronounced Differently; 
(7) Test of Rhyming Ability; (8) ‘Childhood’ Free Association Test.‘ 

(1) Tonal Memory.—The Seashore test for tonal memory seems to offer an 
example of a test in which imagery may function in recognition. It forms one of the 
series of tests called by Seashore (21) ‘Tests of Musical Ability.” These tests come 
in the form of phonograph records, one test to a record. Some of the tests have been 
reported as having low reliability, and their validity as tests diagnostic of ‘musical 
ability’ is in dispute. The ‘Tonal Memory’ test, however, yielded with the subjects 
of the present experiment a reliability coefficient of .go, and it has recently been 
reported by McCarthy (15) as giving a higher validity coefficient than any of the 
other Seashore tests. While I am not here concerned with the validity of the test as - 
a measure of ‘musical ability’ but rather with its ability to discriminate individuals 
reporting different sorts of processes in their performance of it, nevertheless if it is 
to any degree a valid test of musical ability and if at the same time success in it should 
prove to be correlated with imagery reports of a given sort, then the question as to 
the effect of ‘training’ in imagery, upon musical ability, would evidently be in order. 

In this test a series of tones varying in pitch is presented to the subject; then 
after a brief pause the same series, with the exception that the pitch of one of the tones 
is slightly raised or lowered, is repeated, and the subject is required to enter on a 
blank ruled for the purpose the number of the tone which was altered—whether, for 
example, when the series of tones presented numbered six, it was the first, second, 
third, fourth, fifth, or sixth which was different the second time from the first. The 





* In addition to these, Army Alpha scores were available for correlational purposes. 
42 
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total number of judgments to be made is fifty; the series of tones to be compared 
range in length from two to six, and ten judgments are made of each. The score 
used was simply the total number of right judgments. 

(2) Simple Geometrical Figures —In devising tests which involve the perception 
and reproduction of geometrical forms experimenters need to take note of the fact 
that the ‘same,’ from the purely objective standpoint, stimulus-pattern may be 
perceived quite differently by different subjects.5 Where it is desired to place this 
variable as nearly as possible under control (where, for example, as in the present 
experiment, it is desired to make the test primarily a test of visual, or kinesthetic, 
rather than of verbal memory) resort is had to a procedure akin to the nonsense- 
syllable technique of Ebbinghaus, and the visual patterns to be presented are made 
as nearly as possible ‘meaningless.’ In the present case, a large number of geometrical 
figures were shown, one after another, to the individuals comprising some of the 
‘try-out’ sections mentioned previously, and these individuals were asked to write 
down all the verbal associations which each pattern ‘set off.’ The patterns to which 
it proved easiest to give verbal tags which seemed to aid materially in the subsequent 
reproductions were discarded, and those hardest to ‘conceptualize’ accurately during 
the short exposure periods were used in the later experiments. These selected patterns 
could conceivably be studied, and retained in memory, either visually or kinesthetically 
or both in combination, but in the case of most of them, and for most if not all of our 
subjects, no simple verbal tag sufficed to assure accurate reproduction. The twelve 
figures chosen (see appendix) were shown to the subjects two at a time, and reproduction 
was called for immediately. The exposure time was five seconds for each pair of 
figures. Scoring was in terms of accuracy of reproduction. Each figure consisted of 
a curved line joined to two straight lines; if the reproduction was perfect (that is, if 
the position of the curved line with reference to the two straight lines, and the direction 
of all three lines, was like the original) it was scored ‘3,’ if two thirds of it was repro- 
duced correctly but the other segment was absent or wrong in direction the score 
given was ‘2,’ if one segment was right but the other two in the wrong direction or 
omitted, the score was ‘1.’ If a given pattern was all wrong, or omitted completely, 
‘o’ was recorded. Sample figures were shown, and the method of scoring was fully 
explained, before the test was given. 

(3) Roman and Arabic Numerals.—This forms one of the group of tests labelled 
by Fernald ‘tests for more accurate analysis of type.’ Roman and Arabic numerals, 
in mixed order and arranged in Binet letter-square form, are presented to the subjects 
for a period of ten seconds. The subjects are instructed to reproduce them as accu- 
rately as possible. Scoring is in terms of number, and accuracy of placement, of 





§ Various investigators (Katz (13), Wulf (30), Messmer (16), Gibson (10), Allport 
(1), ¢.g.) have established this fact, which has led some of them to posit ‘types’ of 
observers—for example, ‘central’ and ‘peripheral’ types (Katz), ‘comprehensive’ and 
‘isolative’ types (Wulf), ‘subjective’ and ‘objective’ types (Messmer). Whatever 
the merits of such attempts to classify subjects in rigid ‘type’ categories, the fact of 
individual differences in the perception of a given objective situation I believe is 
incontestable. O’Brien (18) reports that ‘attentive selection,’ exercised by his subjects 
‘among the various presentative aspects of a material,’ was an important variable 
and one difficult to place under control. Fernald’s subjects showed striking individual 
differences in tendencies toward ‘naming,’ while studying pictures and other visually- 
presented materials. Gibson reports that one pattern, ¢.g., was variously ‘thought of’ 
by different ones among his subjects as ‘two triangles,’ as ‘letter W,’ and as ‘mountains.’ 
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numerals reproduced. An item reproduced correctly both as to form and placement 
was scored ‘3’ if correct in form but misplaced one position vertically or horizontally 
it scored ‘2,’ if correct in form but misplaced more than one position a score of ‘1’ 
was given, and ‘0’ was scored for any item wrong in form or omitted altogether. 
This is a departure from the form of scoring used by Fernald; it seemed to me to 
distribute the subjects better and to be better adapted to correlational treatment of 
the results. 


(4) Arabic Numerals.—Like the above, except that Arabic numerals only were 
used. 

(5) Words Alike in Sound but not in Spelling.—This also formed one of Fernald’s 
‘tests for more accurate analysis,’ and like the Roman and Arabic Numerals test this 
one was intended by Fernald to favor those with strong visual imagery and to handicap 
those weak or lacking in visual imagery. As in the last two tests the test items are 
arranged in letter-square form, with ten seconds for study of the visually-presented 
material. Scoring is the same as above. 

(6) Words Spelled Alike but Pronounced Differently.—Same as above, except that 
in this test pairs of words with identical spelling but different pronunciations are used, 
and the words are read to the subject at the same time that they are presented visually. 
Time of presentation the same. Intended by Fernald to favor those subjects able to 
utilize auditory imagery in recall. In the middle position, in each ‘form’ of this test, 
is one ‘odd’ word, unlike in either appearance or sound any of the others. 

(7) Rhyming Ability —lIn the beginning the same list of stimulus words was used 
which Fernald had used with her subjects. The ‘number of rhymes given per unit 
of time,’ did not prove to be diagnostic of imagery differences, but a study of the 
‘errors’ made by the subjects to whom this group of stimulus words was given revealed 
the fact that most of the subjects making them reported themselves lacking in auditory 
imagery in the performance of this test. The number of errors made was not large, 
however, and the test’s reliability was low, so an attempt was made to modify the 
procedure with a view to making the test more difficult and thus distributing the 
subjects more widely in scores made. A different series of stimulus words was chosen 
(the original series was ‘speak,’ ‘low,’ ‘case,’ ‘home’; the new series was ‘lose,’ ‘cost,’ 
‘loose,’ ‘boss’) which it was believed might necessitate somewhat finer discriminatory 
capacity, and instead of reading the stimulus-word aloud to the subjects it was now 
written on the blackboard and left in sight during the minute allowed to find rhymes 
for it. The instructions laid stress on the fact that the words put down must be good 
rhymes—that they must sound like the stimulus-word. No restrictions were placed 
on the number of syllables in the words put down, and it was stated that any English 
words were permissible, including slang, colloquialisms, and proper names, but that 
words from foreign languages were barred. Various scoring techniques were tried 
out in connection with this test, and these will be discussed in connection with the 
presentation of the test results. 

(8) ‘Childhood’ Free Association Test.—This test was one of the ones used 
when time did not permit carrying out the regular test schedule. It was given to 
one section for one half minute, to two other sections for one minute each, and to 
another for two minutes. The results seemed sufficiently interesting and significant 
to warrant their being reported. The sections who took this test were simply told 
that the experimenter would pronounce a stimulus-word, and that they were then 
to let their minds roam freely and write down every word they thought of until they 
were told to stop. The word ‘childhood’ was the stimulus word. 
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EXPERIMENTAL RESULTS 


Under this heading will be presented, first, the results of 
experimentation considered with reference to the relation 
between subjects’ reports and test scores. Second, test 
intercorrelations will be presented, for consideration in con- 
nection with the question as to the degree of specificity of the 
abilities measured by the various tests. 


I. Relation Between Test Scores and Subjects’ Reports 


1. Tonal Memory.—Reports of the subjects as to their 
manner of performance of this test were grouped into two 
categories, according as they reported the presence or absence 
of auditory imagery. ‘The ‘auditory’ group, in this treatment 
of the results, thus is made up of all those who reported either 
‘A’ alone, or ‘AK’ or ‘AV’ or ‘AVK.” The other group com- 
prises all those who reported either ‘K’ or ‘V’ or ‘VK.’ 
Referring to Table 1, the median score of the ‘auditory’ 
group is seen to be 38.67. The median score of the ‘non- 
auditory’ group is 29.5. Approximately nine out of ten of 
the ‘auditory’ group reach or exceed the median score of the 
‘non-auditory’ group. The difference between the mean 
scores of the two groups is nine times its probable error, in- 
dicating that the difference in favor of the ‘auditory’ group 
represents a genuine difference in the ability or abilities which 
are measured by this test, rather than a difference which might 
occur by the operation of ‘chance’ factors alone. 


TABLE 1 


CoMPARISON OF DIFFERENT IMAGERY Groups IN ScoRES RECEIVED IN TONAL 
Memory AND GEOMETRICAL Ficures TESTS 








Percent Reliability of 





Crit. ne Test 
: — Me- | y : . | or Passing 
Name of Test Dominant Imagery diem Mean| S.D a Med. of 


Other 144 | Whole 
Group vs. 4% | Test 





Tonal Memory | Auditory (n= 133) .| 38.67] 37.98] 7-68] 9.09] 87.2 817 | .90 
Non-Aud. (m=35) .| 29.50] 29.17] 7.54 11.4 . 





Geometrical Visual (n=96).....| 29.04] 28.27] 3.93} 6.65} 81.3 654. | .79 
Figures Non-Visual (n= 39) | 25.00] 24.48] 4.64 20.5 
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2. Test for Memory of Geometrical Figures.—The subjects 
were grouped into two classes, as in the test for Tonal Memory, 
but this time the basis for division was the presence or ab- 
sence of a ‘visual imagery dominant’ report. Thus the ‘vis- 
ual’ group includes all those who reported either visual 
imagery alone, or those who reported visual as ‘dominant’ 
over whatever else they reported along with it. The ‘non- 
visual dominant’ group includes all those subjects who re- 
ported either ‘K,’ ‘A,’ or ‘C’ (‘C’ here stands for reports like 
‘gave them names,’ ‘thought of objects,’ ‘tried to associate 
them with objects,’ and the like), either alone or as dominant 
over ‘V,’ and also the subjects who reported ‘None’ or ‘?.’ 
Table 1 shows how the two groups compare with each other 
in their performance of this test. The critical ratio, 6.65, 
means that the chances are negligible that the difference 
found between these two groups, in favor of the ‘visual dom- 
inant’ group, could have occurred by chance alone. Eight 
out of ten of the ‘visual dominant’ group reach or exceed 
the median score of the ‘non-visual dominant’ group. 

The conditions under which this test was administered 
probably weight it in favor of the ‘visual dominant’ group, 
and if the conditions were changed in any one or more of 
several ways the advantage in their favor shown by the above 
figures would quite probably be wiped out, or even reversed 
in favor of dominantly ‘kinesthetic’ or ‘conceptual’ perform- 
ance. ‘That is to say, a five-second presentation of a pair of 
patterns of the sort used in this test apparently favors those 
who can ‘get’ and carry in memory a visual image of the 
patterns shown, as against those who attempt to utilize the 
five-second exposure period either for ‘kinesthetic fixation’ 
of the figures or for attempted verbal descriptions of them. 
But if the patterns shown were more complicated and if 
scoring were in terms of number of repetitions necessary for 
perfect reproduction, verbalization and kinzsthesis (except 
for ‘eidetic’ individuals, reputedly rare among adults) would 
perhaps be reported ‘dominant’ more frequently than ‘visual 
memory’ would be, and might result in better scores. The 
influence of seemingly slight changes in experimental condi- 
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tions upon the reports of subjects who are versatile enough 
in their command of different imagery modalities to warrant 
their classification as ‘mixed’ or ‘balanced’ with respect to 
dominant imagery tendencies has been observed by man ex- 
perimenters. 

The objective data, then, in the case of this test as well as 
the preceding one, shows reports of given imagery processes 
(which a priort analysis would point to as aiding in or essential 
to the performance of the tests) to be correlated with success, 
and shows reports of the absence of these processes to be 
associated with relatively poor test performance in those 
subjects so reporting. 

3. Tables 2 and 3 present data from three of the tests 
chosen from among the ones called by Fernald ‘tests for more 
accurate analysis of type,’ and from one other similar test, 
‘Memory for Arabic Numerals.’ 


TABLE 3 


Ratio OF THE MEAN Score oF THE “‘No V or A” Group To THE AVERAGE OF THE 
MEAN Scores oF ALL THE OTHER Groups 























. . Words Alike in Sound | Words Spelt Alike 
Arat nan ¢ Arabic eS ae at piigies y 
Arabic Roman and Arabic not in Spelling |Pronounced Differently 
: | ) 
Percent of Percent of Percent of Percent of 
Total Score Total Score Total Score Total Score 
Total | Obtainedfrom | Total | Obtainedfrom | Total | Obtained from | Total | Obtained from 
Score Inaccurate Score Inaccurate Score Inaccurate | Score Inaccurate 
Placement of Placement of Placement of Placement of 
Correct Items Correct Items | Correct Items Correct Items 
' 
| 
88 1.82 .79 2.15 76 | .g2 | 65 1.19 














The figures listed in Table 2 in the columns headed ‘ Num- 
ber Times Reported’ show the extent to which subjects of this 
sort differ among themselves in their reports as to their mode 
of performance of a given test. The same columns also show 
the kind and extent of the changes in their reports, with shifts 
in the kind of material presented or in the manner of its pre- 
sentation. The first three tests—Memory for Arabic Num- 
erals, Memory for Roman and Arabic Numerals, and Memory 
for Words Alike in Sound, but not in Spelling—were all pre- 
sented visually. The fourth—Memory for Words Spelt Alike, 
but Pronounced Differently—combined visual and auditory 
modes of presentation. 
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Some factor, obviously enough, is operating to prevent 
these subjects all giving the same type of report for any given 
test, which would tend to be the result if their responses were 
determined primarily by the mode of presentation rather than 
by real differences in their manner of attending to the pre- 
sented material, or in their manner of recall, or both. It is 
apparent, however, that slight changes in the stimulus- 
material may for some subjects lead to changes in their reports 
(which accords with the findings of Segal (23), Sybel (27), 
Fernald (8), O’Brien (18), and others). Note that when the 
only change in the material presented is that from Roman and 
Arabic numerals, mixed, to all Arabic numerals, the mode of 
presentation being visual in each case, the number of ‘V’ 
responses drops from 70 to 61 and of ‘VK’ from 96 to 88, 
while the ‘A’ and ‘AK’ responses increase from 25 to 35 and 
from 43 to 55 respectively. Observe, too, that the ‘Number 
Times Reported’ column shows for all three of the tests involv- 
ing visual presentation alone a considerable number of re- 
ports (28 percent) which fail to include visual imagery, while 
in the case of the fourth test, with auditory stimulation added 
to the visual, there are eighty-one reports (20 percent) which 
make no mention of auditory imagery. It should be noted 
that the four ‘forms’ of a given test were never presented one 
after another. There were always three other tests between 
any two forms of the same test, so that these shifts in report— 
some kinds of reports tending to go with one test more than 
another, as in the case of the increase in ‘A’ reports in the 
‘Arabics’ test over the ‘Roman and Arabic’ test—represent 
genuine tendencies away from stereotypy in the responses 
given by these subjects. 

The same columns also show that the majority of the sub- 
jects in the present investigation, like Fernald’s more sophisti- 
cated group of subjects, would be classed as ‘mixed’ with re- 
spect to their imagery tendencies in test situations like the 
ones here reported.® 


6 In this respect these subjects contrast rather sharply with ‘specialized’ groups 
of artists, musicians, scientists, and writers to whom I gave similar tests in an earlier 
investigation (5). 
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The last test in Table 2 furnishes another point of similar- 
ity between Fernald’s sophisticated subjects and my own 
relatively naive group. Fernald states that this test called 
forth reports of auditory verbal imagery from all her subjects 
who ever reported such imagery, and the same holds for my 
group. In fact, all the shifts in reports, and combinations of 
reports, listed by Fernald as characterizing her group in 
connection with these tests are paralleled in the accounts 
given by my unsophisticated subjects; which fact I incline to 
interpret as lending further support, in addition to that 
afforded by the objective data, to the validity of the latter’s 
verbal reports. 

With reference to objective differences in test scores, be- 
tween groups giving different reports, perhaps the most 
significant fact to be noted in Table 2 has to do with the 
scores received by the ‘No V or A’ group. In these tests, 
which were planned to favor the use of either visual or auditory 
imagery, those who report the absence of both are consistently 
at the bottom, in mean total score, in all four tests. The 
consistency indicates that the difference is probably a reliable 
one, pointing to a real divergence in manner of performance, 
or in ‘ability’ of some sort, between this group and the others. 

In two of the tests (those calling for the perception and 
retention of the greater number of discrete impressions) the 
‘No V or A’ group gets a much lower mean ‘accuracy’ score 
than do the other groups. One possible explanation of this 
would be that these individuals, emphasizing vocal-motor 
processes, take slightly more time for the perception of each 
discrete stimulus than do those reporting ‘visual’ and ‘audi- 
tory’ alone or in combination with each other, or with ‘K.’ 
This would result in fewer repetitions being possible during the 
ten second ‘learning’ period, and accuracy might be lessened 
in consequence of the fewer repetitions. Another possibility 
is that a ‘perseveration’ factor goes with attention to auditory 
and visual qualities to a greater degree than is the case when 
attention is dominated by experiences of movement. 

The tests involving memory for words give interesting 
results in another respect. Those reporting neither visual 
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nor auditory images change their scores for ‘accuracy’ on 
these two ‘words’ tests relatively little with respect to their 
corresponding scores on the two ‘numerals’ tests, when com- 
pared with the performance of the other groups in this con- 
nection. Their mean total scores on the ‘words’ tests, how- 
ever, are not only somewhat lower, absolutely, than their 
own similar scores on the ‘numerals’ tests, but are also lower, 
relative to the mean total scores of the other groups. (Table 
3 shows the ratio between the mean score of the ‘No V or A’ 
group and the average of the mean scores of the other groups 
combined, with respect to both mean total score and the per- 
cent of the total score accruing from inaccurate placements of 
items correct in form, for all four of the tests here being dis- 
cussed.) The interpretation already suggested, that those 
reporting ‘No V or A’ are handicapped because of the methods 
which they employ in memorizing and in recall, seems to be 
reinforced by the results from these ‘words’ tests. ‘The other 
groups get approximately the same mean total scores in the 
‘words’ tests as in the ‘numerals’ tests, but a much larger 
part of these totals now is contributed by credit given for 
inaccurate placements of correct items. ‘The ‘perseveration’ 
factor, if real, works here for retention of more ‘type’ or ‘key’ 
words rather than for the exact location of many items. 
With a few words actually ‘remembered,’ and recorded, the 
blank spaces in the ‘letter-square’ can be filled in with words 
not directly retained as a result of the immediately preceding 
sensory stimulation, but supplied ‘rationally,’ since it is known 
that some of the words are ‘like’ some of the others. These 
latter groups, then, reproduced fewer items in the case of 
the ‘numerals’ tests than in the ‘words’ tests but reproduced 
them more accurately; this is also true of the ‘No V or A’ 
group, but to a much less degree. 

Fernald believed that tests of the type of the ‘Roman and 
Arabic’ and the ‘Words Alike in Sound, but not in Spelling’ 
tests would distinctly handicap those stressing auditory 
imagery to the exclusion of visual, as well as those individuals 
in whom vocal-motor processes predominate to the exclusion 
of both visual and auditory imagery. ‘The results recorded 
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in Table 2 do not show the dominantly ‘auditory’ individuals, 
at least, to be under much of a handicap in the performance 
of these tests, except possibly in the matter of ‘accuracy’ in 
the case of the ‘words’ tests. Here, while the difference in 
mean total scores is slight between those reporting ‘visual’ 
and those reporting ‘auditory,’ the scores for ‘percent of total 
scores obtained from inaccurate placements’ show the ‘audi- 
tory’ groups (‘A’ and ‘AK’) much lower in accurate place- 
ment of the test items than the ‘V’ group and somewhat 
lower than the ‘VK’ group. In each ‘form’ of this test there 
are always three words different in sound as well as in spelling, 
and if any of these three have been retained in memory, 
through their sound or in any other way, it is possible, as 
noted above, to ‘guess’ the other words which sound like them 
but which are spelled differently, and to insert these at random 
in the unfilled spaces in the letter-square. But these ‘ran- 
dom’ insertions cut down the ‘accuracy’ score, and here 
visual memory has the advantage, as Fernald intended should 
be the case. With the ‘Roman and Arabic’ test, the fact 
seems to be that the Roman numerals mixed in with the 
Arabic are quite easily ‘verbalized’ and remembered in audi- 
tory terms by those to whom auditory imagery is to any degree 
‘natural.’ The line ‘6 IX 4,’ for example, instead of being 
memorized by them as ‘six-nine-four’ is memorized auditorily 
as ‘six-eye-ex-four’ and reproduced accurately in consequence. 
But in the ‘Words Alike in Sound, not in Spelling’ test, the 
short presentation time does not allow words like the following: 


‘cent raise site 
sight raze scent 
rays cite sent’ 


to be spelled out, either auditorily or kinesthetically, so place- 
ment of the separate items, if the words are remembered by 
their ‘sound’ or through more or less implicit vocal-motor 
movements, is necessarily more random than where visual 
images of the appearance of the words aid in their being cor- 
rectly localized. 

In connection with the test, ‘Words Alike in Spelling, not 
in Pronunciation,’ the ratio .65, in the next to the last column 
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in Table 3, when compared with the corresponding figures for 
the other three tests (.88, .79, and .76), points to the probabil- 
ity that the pronouncing of the words by the experimenter 
in this test, along with the visual presentation, was more of a 
distraction to the ‘No V or A’ group than it was to the others. 
If vocal-motor movements predominate in both perception 
and recall for the members of this group (as they report) 
the possible inhibiting and distracting effect of simultaneous 
stimulation of visual, auditory, and proprioceptive sense 
organs is understandable and checks with the results from 
previous investigations. (£.g., Fernald (8), Dallenbach (3), 
Quantz (19), O’Brien (18).) The reports from many of our 
dominantly ‘visual’ and ‘auditory’ subjects, in situations like 
this one, indicate that they exercise the ‘attentive selection’ 
among the ‘various presentative aspects’ of the situation 
noted by O’Brien. The former attend to the visual impres- 
sions and ‘shut out’ the experimenter’s voice. The latter 
may even shut their eyes and concentrate wholly on the audi- 
tory presentation of the stimulus-words, relying on the persev- 
eration of these sound-impressions for success in recall. A 
possible advantage in favor of those concentrating on the 
‘visual’ method of study, in that they might ‘see’ the stim- 
ulus-words several times while those attending to the audi- 
tory stimuli were hearing them only once, might be offset 
if the auditory sensations perseverated longer, or more clearly. 
The data for this test in Table 2 reveal no pronounced supe- 
riority for one of these groups over the other; except the fact, 
previously noted, that all the others are better than the ‘No 
V or A’ group. 

The objective results from these four tests, then, as with 
the two tests previously reported, may be considered as 
offering evidence in support of the validity of the subjects’ 
reports; especially since they are in line with objective indi- 
cations noted by Fernald, using similar tests with more 
sophisticated subjects. 

4. Rhyming Ability Test. This test, with the stimulus- 
words which were substituted for Fernald’s list and with 
visual presentation instead of auditory, proved highly success- 
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ful as an objective check upon the reports of the subjects as 
to the manner in which the words which they put down as 
‘rhymes’ came to them. The clue afforded by the errors 
made by dominantly ‘kinesthetic’ subjects in the first rhym- 
ing test given, seems to have been well worth following up. 
A glance at Table 4 shows how much less discriminating are 
those subjects who report either kinesthesis or visual imagery 
dominant, than those who report that their ‘check’ upon the 
words as rhymes is in terms of auditory imagery. Nine out 
of every ten of the latter make either no errors at all or one 
only: only one out of the fifty-one subjects in this group makes 
six errors or more. In the ‘K and V’ group, there are less 





























TABLE 4 
Percent OF Eacu Imacery Group Makino Errors, RuyMinG TEST 
Dominant Imagery 
Number of Errors AK, AV, or AVK 
A (None Reported K Vv K and V Together 
(nm = 51) *‘Dominant’) (m = 43) (n = 13) (n = 56) 
(nm = 18) 
Cawscewees 68.6 38.9 6.9 23.1 10.7 
Ci it mind eke 21.6 11.1 9.3 0.0 7.1 
ee 7.8 38.9 25.6 7.6 21.4 
2b acai 2.0 5.6 25.6 30.8 26.8 
| re 0.0 0.0 18.6 38.5 23.2 
Sr Ks weencas 0.0 5.6 7.0 0.0 5.4 
TP ssbetasces 0.0 0.0 7.0 0.0 5-4 
Reliability of Test 
4 vs. % = .86 Whole Test = .92 











than two out of ten, on the average, who make only one error 
or no errors, while thirty-four out of 56—approximately 61 
percent of the whole group—make from six to nineteen errors 
each. (For list of words scored as errors see appendix, page 
657.) 

It may of course be argued that the list of stimulus- 
words chosen was not ‘fair’—that it included ‘tricky’ words. 
‘Lose,’ for example, may be habitually misspelled by some 
individuals as ‘l-o-o-s-e,’? and others may be in the habit of 
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spelling the word ‘loose,’ ‘l-o-s-e,’ and the pronunciation may 
be shifted to go with the wrong words. But as previously 
pointed out this attempt to confuse was intentional on the 
experimenter’s part and the point of interest is that whatever 
confusion the experimental situation succeeded in arousing 
was correlated to such a high degree with reports of a particu- 
lar sort—namely those of ‘V’ or ‘K’ as dominant in the per- 
formance of this task. The element of confusion was simply 
not there for the large majority of those reporting auditory 
imagery dominant. As a matter of fact, a glance at the list 
of errors given in the appendix will show that the errors were 
of many sorts and that all four stimulus words were confusing 
to some individuals, as it was hoped they would be. Many of 
these errors seem to indicate that the persons who made them 
are insensitive to sound-distinctions which are perfectly 
definite and clear-cut to those individuals reporting auditory 
imagery dominant in this test. 

It is, of course, quite conceivable that there are individuals 
so sensitive to slight modulations in the proprioceptive stim- 
ulation set up by innervation of the vocal-motor muscle groups 
that distinctions of the sort required in this test could be made 
by them on the basis of their awareness of very slight varia- 
tions in kinaesthesis. ‘Three of my ‘K’ group make no errors 
and four make only one error. This is sixteen per cent of the 
entire ‘K’ group, as contrasted, however, with the ninety per 
cent of the ‘A’ group (forty-six individuals out of a total of 
fifty-one) who find the test such an easy one as such scores 
indicate it must have been for them. 

I find it more difficult to account for three subjects, report- 
ing that they selected the words on a ‘visual’ basis entirely, 
who also scored noerrors. In their cases it seems to me either 
that their introspections must be inaccurate, their verbal 
reports have been entered hurriedly on their papers, and con- 
sequently incompletely or inaccurately, or that they did 
choose ‘rhymes’ purely on a visual basis and by chance hap- 
pened to get words which ‘looked’ like the stimulus word and 
also sounded like it. The last alternative seems improbable, 
even though the number of words which could be put down in 
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a minute in response to a given stimulus-word was not very 
great and the chances for accidental success were consequently 
better than they would have been if the recall-interval had 
been several times as long. 

A comparison of the results of this test, with these subjects, 
with the results of the rhyming test used by Stumberg (25) 
with her ‘poet’ and ‘non-poet’ groups, is of interest. Her 
tests differentiated her subjects, in favor of the ‘poets,’ with 
respect to the total number of ‘right’ rhymes given, when 
her groups were approximately equated for ‘intelligence.’ 
No light was shed by her results upon the cause of the differ- 
ence which she found, and of course the simple fact of unequal 
amounts of ‘practice’ might be adequate to furnish the expla- 
nation in the case of her subjects. (With the stimulus words 
used in the present experiment, and the short time-limits, 
‘total number of “right”? rhymes’ proved not only a less 
reliable measure than either ‘number of errors’ or ‘rights 
minus wrongs’ but also a less sensitive index to both ‘kind of 
imagery reported dominant’ and ‘Alpha’ scores.) But the 
question as to what causes some individuals to ‘go in for’ 
poetry, while others with approximately the same amount 
of ‘g’ are indifferent to it, is of interest from the purely be- 
havioristic standpoint. Except in the case of free verse, the 
ability to find ‘good’ rhymes with ease is probably one of the 
factors predisposing toward an interest in poetry writing. 
The verbal reports of the subjects in the present experiment 
point to the functioning of something which they unite in 
calling auditory imagery as at least one of the important com- 
ponents of ‘rhyming ability.” As in the case of the Seashore 
Test for Tonal Memory this indication that imagery plays a 
part in behavior accomplishments of a particular sort raises 
once more the question which interested Fernald, that of the 
feasibility of developing the possibly latent imagery potential- 
ities of individuals through ‘training.’ ? 

7From a strictly practical standpoint there are probably many who will hope 
that we may remain in ignorance on this point, since (such persons will insist) there 
are too many poets and near-poets turned out even with our present haphazard 


methods of encouragement of them, and society obviously needs a certain relatively 
large proportion of non-poets if its more prosaic needs are to continue to be met. 
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5. ‘Childhood’ Free Association Test. Results from this 
test are included here because they bear on the question as to 
the effect of imagery differences upon overt language re- 
sponses, and because they seem to be in line with individual 
differences in association tendencies noted by Stumberg (26) 
and by Downey (31). Since only one ‘form’ of the test was 
given to the sections which took it no reliability coefficient 
can be reported. But the results from the several sections 
which performed the test were consistently in the same direc- 
tion even though the time limit varied, for different sections, 
from one half minute to two minutes. The results are tabu- 


lated in Table 5. 


TABLE 5 


FrEQuENcY oF Nouns AND Oruyr Parts oF SPEECH IN Lists oF Worps 
WRITTEN IN RESPONSE TO THE StimMuLUS Worp ‘CHILDHOOD’ 




















7 - : Mean No. Other Parts Ratios: No. Nouns + No. 
Mean No. Nouns of Speech Other Parts of Speech 
Time 
Limit 
Reporting Reporting Reporting Reporting | Reporting | Reporting 
V Alone or A, K, VK, V Alone or A, K, VK, | V Alone or | A, K, VK, 
Dominant | AK,orVAK]| Dominant |AK, or VAK} Dominant |AK, or VAK 
4 Min. (n = 22) (n = 11) 
9.05 6.91 1.00 2.18 9.05 3.17 
1 Min. | (n = 35) (n = 24) 
15.51 12.21 1.66 6.88 9.34 1.77 
2 Min. (n = 23) (n = 6) 
30.09 19.67 4.39 19.83 6.85 .99 























It seems clear that a stimulus-word such as ‘childhood’ sets 
off in those individuals who report their associations as ex- 
clusively or dominantly ‘visual’ a tendency to put down words 
which name the particular scene or individuals ‘appearing’ 
to them in their visual images. These subjects consistently 


put down many fewer ‘relational’ and ‘modifying’ and 
‘action’ words than do those who report either something 
other than visual imagery as dominant, or nothing dominant. 
The number of other parts of speech written down increases 
with both these groups as the time limit increases, but the 
relative number of nouns given increases in the case of the 








SIGNIFICANCE OF IMAGERY DIFFERENCES 651 


‘visual dominant’ subjects, decreases with the other subjects, 
as the response period lengthens.® 


II. Test INTERCORRELATIONS 


Tables 6 and 7 show the manner in which the tests already 
reported upon correlate among themselves and with Alpha. 
The most striking thing about both tables is that the inter- 
correlations among the several tests of ‘immediate memory’ 
are no higher than they are. In psychological literature, 
both experimental and theoretical, ‘immediate memory’ is 


TABLE 6 


Test INTERCORRELATIONS 
(1928 Data) 











Roman Words Alike Words 
Arabic and in Sound, not | Pronounced 
Arabic in Spelling and Shown 
ieee kinins wise wala 
Roman and Arabic........ .383 + .06 
Words Alike in Sound, not 
II, 6 han 600 noma 106 + .07 | .221 + .06 
Words Pronounced and 
ich ddemde kwon .284 + .06 | .309+ .06 | .466 + .05 
Army Alpha..............| .0O1 & .07 | .229+ .06 | .258 + .06 | .299 + .06 

















still frequently referred to as though the term really described 
some sort of unitary function. The evidence here presented, 
it seems to me, indicates clearly enough that ‘memory’ does 
not operate independently of the kind of material and the 
manner of its presentation. In the light of the objective 
support accorded the subjects’ verbal reports in the case of the 


®It might have been anticipated that the ‘visual dominant’ group might use 
more adjectives, descriptive of the scenes or persons they were ‘picturing’ to them- 
selves, than would those whose associations were less concerned with concrete ‘ par- 
ticulars.’ This did not turn out to be the case. Adjectives were reported in virtually 
equal proportions by these two groups. There is some slight indication that the 
‘nouns’ cf the ‘Non-visual dominant’ groups tend more toward abstractness than 
do the nouns put down by the ‘visual dominant’ groups. That is, there seemed to 
be a tendency for words like ‘justice’ and ‘kindliness’ and ‘unhappiness’ and ‘friend- 
ship’ to appear more often in the lists of the former than in those of the latter subjects. 
This tendency might manifest itself more clearly with more subjects and longer time 
limits, or with subjects of a different sort than these—say college professors, ministers, 
and lawyers, on the one hand, compared with poets, writers of romantic fiction, 
painters, and possibly the large and growing tribe of ‘mammy-song’ writers, on the 


other. 
43 
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TABLE 7 


Test INTERCORRELATIONS 


(1929 Data) 




















Tonal eometri Words 
Memory ” esau Rew) pin 1 eg 
Se 
Geometrical Figures...............] .2014.06 
Rhymes (R-W)...................] -1924.06 | .195+.06 
Words Pronounced and Shown..... .} .3572:.06 | .205+.06 | .305-+.06 
eee 291.06 | .335+.06 | .622+.04 | .508-++.05* 











* Note: The “words pronounced and shown”’ test was scored differently in 1929, 
in that words were scored as correct if they were spelled in accordance with their 
sound, whether or not this spelling accorded with the words as shown visually. The 
intention was to weight this test still more in favor of auditory recall. In 1928 only 
words spelled like the visually-presented stimulus-words were scored as correct. 
This increased latitude allowed the 1929 subjects may account in part for the higher 
correlation with Alpha which the 1929 scores in this test yield. 


present tests I shall venture to suggest that still other factors 
which may affect memorial accomplishment are (1) the man- 
ner in which a subject ‘attends to’ or ‘perceives’ the presented 
material, and (2) the manner in which he attempts to recall the 
material.® 

These tests, however, were specifically devised, or chosen 
from among tests already in existence, to stimulate different 
modes of attending and recalling, and to measure their 
products in case such differences were uncovered. That they 
succeeded—to the extent indicated by the subjects’ reports, 
by the objective differences in test scores on the parts of sub- 
jects reporting different imaginal processes dominant, and by 
the relatively low intercorrelations—by no means implies that 
other tests of immediate memory could not be found which 
would correlate to a high degree with each other. ‘To find 
tests of immediate memory which will intercorrelate highly 
is patently a very simple matter. All that is necessary to 


® This statement does not, of course, rest on the results from the present experi- 
ment alone. The studies of Warden (28), Scott (20), Husband (12), and Cox (2), 


in the field of human maze learning, the work of Gibson (10), Wulf (30), Messmer 
(16), Katz (13), and Allport (1), on perception and reproduction of visually perceived 
forms, and observations, made by Kuhlmann (14), Segal (23), Sybel (17), Meuman 
(17), Fernald (8), Griffitts (11), O’Brien (18), and others in studies similar in some 
one or more respects to the present one, all point in the direction indicated. 
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achieve such a result is to have the materials used in the differ- 
ent tests similar enough in content, and to keep the method of 
presentation reasonably constant; then inter- and intra-sub- 
ject differences with respect to modes of perceiving and recall- 
ing will scarcely operate to prevent high correlations between 
the tests.!° 

It is not my intention to enter into any thoroughgoing 
discussion of Spearman’s ‘Two-Factor’ theory (24), or the 
various alternative theories to Spearman’s, all of which at- 
tempt to explain the fact that test intercorrelations tend al- 
ways to be positive and that they tend to fall into a hier- 
archical arrangement as regards their magnitudes. The 
correlational data here presented are inadequate to throw 
much light on the ‘group factor’ versus ‘g’ argument. It is 
plain enough, however, that if Army Alpha is here taken as a 
measure of ‘g’ or ‘general intelligence,’ the extent to which 
the latter ‘saturates’ these tests is for the most part very 
slight. Whether the abilities called into play in the perform- 
ance of these tests represent the functioning of specific factors 
more than they do group factors, or the other way around, is 
not clear; it 1s quite clear that there is much more involved 
in their performance than whatever it is that Alpha and sim- 
ilar ‘general intelligence’ tests measure. (This statement 
holds even in the case of the ‘rhymes’ test and the ‘words 

10 This is apparent in the reliability coefficients of the tests now under discussion, 
computed by means of the ‘split halves’ technique. Except in the case of the Seashore 
Test of Tonal Memory (in which case the total score from twenty-five ‘odd’ judgments 
was correlated with the total score from twenty-five ‘even’ judgments) the reliability 
coefficients of all the other memory tests were obtained by correlating scores from 
two short test ‘forms’ against the scores from two other similar test ‘items’ or ‘forms.’ 
(With the Geometrical Figures test it was three versus three.) The separate ‘forms’ 
of any given test were made up of material which was ‘similar’ only in some very 
general way. Still (although, as noted previously, each two successive ‘forms’ were 
separated by three other tests) the ‘whole test’ correlations for the tests treated in 
this way were as follows: .48 (Arabics), .57 (Roman and Arabics), .72 (Words Alike 
in Sound, not in Spelling), .74 (Words Pronounced Alike, Spelled Differently, .79 
(Geometrical Figures), .92 (Rhymes-Errors), .78 (Rhymes-R-W). These coefficients 
are not ‘high’ as reliability coefficients are usually judged, but most of them are 
very high in comparison with the test intercorrelations. For present purposes they 
may also be considered satisfactory reliability coefficients in view of the shortness of 


the tests, the homogeneity of the ‘population’ of subjects, and the fact that the test 
scores are here used for group comparisons, not for prediction from individual scores. 
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pronounced and shown’ test, which show a higher degree of 
relationship with Alpha than do any of the others.) The 
verbal reports of the subjects point to this ‘much more’ as 
being in part, at least, different amounts of facility in the use of 
what these subjects agree to call imagery, of one sort or 
another. 

So far as these tests emphasize retentivity, especially the 
pure perseveration of sensory stimulation, low correlations 
with ‘g’ were to be anticipated. It will be remembered that 
the early constructors of intelligence tests included tests of 
memory span on a priori grounds, and were somewhat discon- 
certed to find that these tests not only correlated to a low de- 
gree with the other sorts of tests hit upon (which would have 
been a point in their favor) but that they also correlated low 
with ‘criteria’ of general intelligence (which was distinctly 
not in their favor). Now Spearman sums up (24) a large 
amount of work by investigators connected with his labora- 
tory, and announces that their findings all point to a ‘ retentiv- 
ity’ factor as tending to ‘diminish the saturation of a test 
with “‘g.””? That is to say, whenever a test stresses retentiv- 
ity, the correlation of that test with measures of ‘g’ goes down. 
Obviously most of the tests here being reported stressed re- 
tentivity; most of them, intentionally on my part, stressed 
‘perseveration, either visual or auditory, or both." The 
tests which to any extent involve ‘eduction of relations,’ or 
of ‘correlates,’ (the ‘rhymes’ test, obviously; to a lesser degree 
probably the two ‘words’ tests, as already noted at the 
time the results from these tests were presented and their 
interpretation considered) are just the ones which accord- 
ing to this theory should be most ‘saturated’ with ‘g.” The 
intercorrelation tables show this to be the fact. ‘Calling up 
rhymes’ may be considered an ‘intellectual’ task insofar as it 
involves the comprehension of the essential conditions for 
‘rhyming’ and insofar as it also involves having a stock of 


For Spearman ‘perseveration’—or ‘inertia’ or ‘persistence’—is the ‘second 
division’ under the general heading ‘retentivity.? These are not hypothetical ‘facul- 
ties,’ as he uses them, but terms descriptive of ways in which test data really behave. 
The low correlations of my more nearly ‘pure retentivity’ tests with Alpha are evidently 
quite in line with Spearman’s evidence on this point. 
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words on hand to select from; it is also ‘sensory’ or ‘imaginal’ 
to the extent that the check upon a given word as a ‘rhyme’ 
or ‘not-rhyme’ is dependent upon fineness of discrimination 
among sounds or movements, sensed or imaged. The former 
factors tend to push up the correlation with ‘g’; the inclusion 
of demands for fine sensory discriminations in a test tend, like 
emphasis upon retentivity, to diminish it. 


Sex Differences 


Griffitts (11) reported small but consistent sex differences 
in auditory imagery in favor of his women subjects. The test 
results in the case of the present investigation reveal no con- 
sistent, or completely reliable, sex differences. Table 8 
presents a comparison of the sexes with respect to their per- 


TABLE 8 


Sexes CoMPARED 
(1929 Data) 
































: : Diff. 
Test n Mean $.D. P.E. Mean M,-—M;: PE. py ier, Lf my 

Alpha 

Males........48 | 151.67 20.57 2.00 7.25 2.55 2.84 

Females...... 731 144.42 19.94 1.57 
Visual Designs 

re 48 28.13 4.37 43 

Females...... 75 27.00 4-44 9% 1.13 55 2.06 
Tonal Memory 

Males........48 35-71 9.11 89 

Females...... 75 36.81 7.72 .60 — 1.10 1.07 1.02 
Words Pronounced 
and Shown 

er 73-30 11.78 1.53 

Females. ..... 75 66.65 11.56 .gO 6.65 1.77 3.74 
Rhymes | 
Right-Wrong | 

ere 27 16.11 8.54 1.11 | 

Females...... 75 13.59 9.44 73 2.52 




















formance in the tests used in the 1929 experimental program. 
(Data from the 1928 tests has not been treated in this way, 
but the 1929 tests contain more tests in which auditory im- 
agery was reported dominant by a large proportion of the 
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subjects than was the case with the 1928 series, so any supe- 
riority in favor of the women in this respect should show up in 
this table, if at all.) The column, Diff. + P.E. Diff., shows 
that none of the sex differences indicated are four times their 
probable error, the usual statistical criterion of an undoubt- 
edly ‘real’ difference. The tests showing the highest critical 
ratios, with the chances in favor of the differences found being 
‘real’ differences, are in favor of the men; but any of the 
differences reported could be due to chance. 


SUMMARY 


1. Reports as to manner of performance of various intel- 
lectual tasks, by the subjects of this experiment, under the 
stated conditions of experimental control, are supported to a 
considerable degree by objective results in the form of differ- 
ences in test scores correlated with differences in the subjects’ 
reports. | 

2. Correlations with ‘intelligence’ as measured by Army 
Alpha are for the most part positive and low, indicating that 
there may be something common between the abilities tapped 
by such ‘imagery’ tests as those here reported and whatever 
Alpha measures, but that there is much more which is specific 
to some or all of the imagery tests, and which is not measured 
by a test of ‘general intelligence.’ 

3. To what extent these abilities which are ‘not intelli- 
gence’ are group abilities, and to what extent they are still 
more narrowly specific in their behavior, is not clearly demon- 
strated in the results from the present experiments. The low 
correlations between Alpha and the immediate memory tests 
which can be considered most nearly tests of retentivity, or 
of retentivity plus sensory acuity, and the higher correlations 
of Alpha with tests like ‘rhymes’ and the ‘words’ tests of im- 
mediate memory (which seem to involve to a greater or less 
extent what Spearman calls ‘eduction of correlates,’ in addi- 
tion to retentivity and sensory or imaginal discriminative 
capacity) are in line with results reported by workers in 
Spearman’s laboratory, and may be interpreted.as supporting 
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the latter’s hypothesis concerning the independent variability 
of such factors as ‘g,’ and ‘sensory acuity’ and ‘retentivity.’ 

4. The high degree of correspondence noted under (1), 
above, between test score differences on the one hand, and 
differences in reports as to the presence or dominance of what 
most of the subjects seemed to have little difficulty in recog- 
nizing as ‘imagery’ of one modality or another, on the other 
hand, lends support to the hypothesis that ‘imagery’ has 
functional reality. If, that is, the ‘image experience,’ or the 
‘image response,’ or however you wish to describe it, is shown 
to have behavior consequences which are objectively demon- 
strable, then there is no longer any ground for the dismissal 
of the image as epiphenomenal, unless the anti-imagist can 
demonstrate a correlation as close, or closer, between the same 
objective behavior differences and the mechanisms which he 
observes and which he claims are adequate for purposes of 
psychological description and explanation. 

5. The results of the present investigation with respect 
to the functional significance of imagery differences are fully 
in line with observations of other experimenters working in 
various fields—maze learning, perception, recall, recognition, 
association, etc. 

6. No completely reliable or consistent sex differences are 
found. 

(Manuscript received September 1, 1931) 


APPENDIX 


Materials Used in the Tests 


Test x. Test of Tonal Memory.—See Seashore (21) and 
(22) for a description of the test, directions for administering 
and scoring, and norms. 


Test 2. Memory for Geometrical Figures—The following 
pairs of geometrical figures were used: 
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VE WO 
ae hL Pa 


Each pair of figures was drawn in India ink on a card measur- 
ing 6 X 4 inches. 


Test 3. Memory for Roman and Arabic Numerals.—The 
four forms of this test were as follows: 





(1) (2) (3) (4) 
6 X 2 3 7 IV 6 Vg VI 7 Xi 
V3 #VIl i2 Vg X 3 VIII 4 #2 VII 
9 8 IV 5 IX 2 2 8 IV 9 X 5 
Test 4. Memory for Arabic Numerals.— 
(1) (2) 
Ir 6 49 9 4. 28 
3 8 17 73 1 5 
5 61 9 2 36 7 
(3) (4) 
3 85 2 4 13 +1 
6 1 47 7 2 98 
8 5 (24 5 9 53 
Test 5. Memory for Words Alikein Sound, Not in Spelling.— 
(1) (2) 
cent rays cite rain two ere 
raise site sent too —reign=s air 


sight raze scent rein to heir 
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(3) (2) 
toad sign mete right pair by 
syne towed meat buy pear rite 
meet sine toed bye write pare 
Test 6. Memory for Words Spelled Alike, Pronounced 
Differently.— 
(1) (2) 
(soo-er) (wound) 
wind lives row sewer wound plead 
(woond) (sloo) 
bass mall _ lives wound top slough 
(sluff) (sow-er) 
row wind bass slough pléad sewer 
(3) (4) 

(teer) 
sOW poll tier dove bow tear 
lead rust sow bow pool read 

(tie-er) (tare) 
poll léad tier tear réad dove 
Test 7. Test of Rhyming Ability—The stimulus words 


were: ‘lose,’ ‘cost,’ ‘loose,’ ‘boss.’ 
Typical ‘errors’ were as follows: 


‘lose’ ‘cost’ ‘loose’ 
nose toast choose 
dose boast whose 
rose most booze 
moose ghost twos 
noose host news 
dude post soothe 
lewd loss tooth 
calaboose moss booth 

toss truth 
sought blouse 
bought spouse 
draught louse 


‘boss’ 
gauze 
because 
cause 
jaws 
Saws 
morose 
dose 
rose 
force 
norse 
indorse 
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THE COMPARATIVE RETENTION VALUES OF A 
MAZE HABIT, OF NONSENSE SYLLABLES, 
AND OF RATIONAL LEARNING 


BY JOHN A. McGEOCH 


University of Missouri 


INTRODUCTION 


The generalization that acts of skill, which are largely non- 
verbal in character, are better retained than are memorial 
materials, which are primarily verbal, occurs frequently in the 
literature of the psychology of learning. It is usually made 
without qualification. The implication is, therefore, that the 
skilled acts are intrinsically the more permanent acquisitions. 
This generalization, if it is valid, has a much greater signifi- 
cance than has been ascribed to it customarily. It is impor- 
tant for theories of mental organization, for theories of the 
peripheral versus the central character of mental functions, 
and more immediately, of course, for theories of learning and 
retention. 

In a previous paper,' this generalization was questioned 
on the basis of a review of the literature on the problem and of 
a direct experimental comparison of the retention values of 
mazes and of nonsense syllables. We were unable to find 
in the literature any direct comparison of the retention of skill 
and of memorial material. The statement that skill is the 
better retained seems to have been made on the basis of a 
comparison of results from entirely different experiments 
which made use of conditions which were far from being com- 
parable. The high degrees of retention found in some of the 
experiments upon typewriting, ball-tossing, and maze learn- 
ing, compared with the relatively low degrees of retention 
yielded by experiments with nonsense syllables and words, 
seem to have figured largely. In most of such experiments 
the subjects were few, the experimental conditions differed 


1J. A. McGeoch and A. W. Melton, The comparative retention values of maze 
habits and of nonsense syllables, this JoURNAL, 1929, 12, 392-414. 
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widely and, in particular, the degrees of learning were un- 
equal, having been almost universally greater in the case of 
skill than in that of memory. There are, moreover, experi- 
ments in which memorial materials are forgotten not much 
more rapidly than are the skills of other experiments. Our 
review of the literature led to the conclusion that no valid 
generalization regarding relative retention values can be 
made from the available experimental work upon the retention 
of skilled acts and of memorial materials.” 

An experimental comparison was made of the retention 
values of mazes of three degrees of difficulty with those of 
nonsense syllable lists, 8, 12, and 16 syllables in length. Each 
maze and list was learned to a criterion of one perfect repeti- 
tion and relearned after one week to a criterion of three per- 
fect repetitions. In terms of an arbitrary criterion, therefore, 
the degrees of learning of the two materials were the same. 
The relative retention values were found to be functions of 
the particular maze and syllable list compared, of the method 
of measuring retention, and of the relearning criterion em- 
ployed. The syllables were clearly superior in terms of trials, 
while in terms of time and errors at least one syllable list was 
equal to, or better than, at least one maze. There was found, 
thus, no evidence for the generalization that skill is better 
retained than is memorial material. 

In a recent paper, Freeman and Abernethy? have held 
that this experiment is limited in two respects. (1) The two 
forms of learning employed differ radically in structure and, 
as a result, it is not possible to say whether the presence of 
overt movement, as such, influences retention. (2) Further, 
“‘the interval between the learning and the re-learning—one 
week—is too short to indicate what differences may exist 
throughout longer intervals.”” With regard to the first limita- 
tion urged, it is, of course, true that mazes and syllable lists 
differ in structure. In an initial examination of the generali- 

2 For a recent discussion of the problem, see W. S. Hunter, Learning: II. Experi- 


mental studies of learning. Chap. 15 in The Foundations of Experimental Psychology, 
ed. by C. Murchison, Worcester, Clark Univ. Press, 1929. 


3F. N. Freeman and E. M. Abernethy, Comparative retention of typewriting 
and of substitution with analogous material. J. Educ. Psychol., 1930, 21, 639-647. 
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zation under discussion, however, it seemed better to employ 
materials of the kind upon which the generalization was, 
though in the absence of direct comparability, based. The 
results of a comparison of such materials under comparable 
experimental conditions would test the generalization on its 
own ground and prepare the way for a more analytic study of 
the specific factors involved. The fact that, under these 
conditions, no radical differences appear indicates strongly 
that the greater dominance of motor components does not 
mean greater retentiveness. It does not, however, constitute 
direct proof of anything save that, under these conditions, 
mazes are no better retained than are syllables. The validity 
of their second point, that a week is too short to be a repre- 
sentative interval, can only be determined by an experimental 
test. This interval was chosen because most retention curves 
are falling relatively slowly and uniformly by the time a period 
of a week has elapsed and there seemed small reason to sup- 
pose that what would be true of relative retention values after 
a week would not be true, in general, after longer periods. 

In an experiment designed to overcome these limitations, 
Freeman and Abernethy had two groups, of 22 subjects each, 
learn a short paragraph which contained every letter of the 
alphabet. One group, previously unfamiliar with the key- 
board of the typewriter, practiced typewriting the paragraph. 
The keyes were covered with blank caps and a large diagram 
was kept on the wall in front of the subject until one errorless 
copy had been made, after which she practiced without the 
diagram. The other group translated the paragraph into 
number symbols. Ona small copy of the diagram used by the 
first group, the letters were numbered according to their 
position on the keyboard, so that for every movement made in 
typing there was a corresponding number. After the subject 
had gone through the paragraph once silently, she recalled 
what she could, speaking the numbers aloud, and the silent 
translation with the diagram and the recall aloud alternated 
until the criterion was reached. At the beginning of each 
subsequent learning period a recall preceded the silent repeti- 
tion. Each group practiced in daily periods of 50 minutes 
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each until a criterion of two successive perfect trials had been 
reached. 

It should be noted that the two materials compared are the 
same in form but differ in the amount of overt movement to 
be learned, and that this is an important comparison to make. 
The conditions of practice of the two groups are not, however, 
formally the same. The typewriting group practiced with 
the diagram until one errorless copy had been made and with- 
out it until the criterion had been reached. The shift from 
guidance from the diagram to independent practice introduces 
a variable the effect of which is unknown. The substitution 
group had the benefits of a recitation after each independent 
practice. The report does not state whether the subjects of 
this group were given immediate information regarding the 
correctness of their recalls, nor whether those of the typing 
group saw what they wrote. In the absence of this informa- 
tion it is difficult to estimate the extent to which the different 
methods of practice may have affected the results. The 
typewriting group required roughly 46 minutes, or one prac- 
tice period, more to reach the criterion than did the substitu- 
tion group. ‘This introduces both more practice, in terms of 
time, and a distribution over a longer time. Freeman and 
Abernethy, however, consider this unimportant. 

After two weeks each group relearned the material with the 
same procedure asin learning. Fight weeks after this relearn- 
ing it was again relearned as before. The saving scores, in 
terms of errors, repetitions and time are very nearly the same 
for the two materials at the first relearning. This is cited as 
being in agreement with the results of McGeoch and Melton. 
At the second relearning, ten weeks after the original learning 
and eight weeks after the first relearning, different results 
appear. The saving scores of the typewriting group show a 
gain over the corresponding scores at the two-weeks relearning, 
while those of the substitution group show a loss. Freeman 
and Abernethy write: ‘The difference is unmistakable. It 
seems to confirm without question the traditional belief that, 
for longer periods of time, learning which involves overt move- 
ments is more permanently retained than is learning which 
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does not involve overt movements. The difference is so great 
that deterioration of the overt habit seems actually to be 
halted after two weeks.’ This is cited as the most important 
result of the experiment and, had it appeared under unequivo- 
cal conditions, it would have been of the first importance. 

Two facts, however, render very doubtful the interpreta- 
tion of the saving scores after ten weeks. (1) Each group re- 
learned after two weeks and again after ten weeks. The 
second relearning has, thus, had antecedent to it both the 
original learning and the relearning after two weeks. (2) 
At the first relearning, the typewriting group required roughly 
86 percent more time for relearning than did the substitution 
group, although the saving scores were roughly the same. 

The superior retention of the typewriting after ten weeks 
may mean that learning which involves overt movements is 
better retained than learning which does not involve such 
movements. There are, however, two other interpretations 
which must be considered and which are very possible ones. 
It may be that learning involving overt movements is the more 
effectively reinstated and refreshed by a distributed practice 
(relearning after two weeks). It may be also that the greater 
amount of time taken to relearn the typewriting was a factor 
in favor of its greater retention at the second relearning. 
Against this it may be argued that the greater time taken to 
learn the typewriting gave it no advantage at the first re- 
learning and hence that the greater time taken to relearn it the 
first time should give it no advantage at the second relearning. 
This is possible, but it is also possible that the effect of greater 
time spent is cumulative or is more potent at relearning than 
at learning. It may be noted that the relative disparity be- 
tween the two materials, in terms of the means, is virtually the 
same at the first relearning as at the original learning. Any 
one of the interpretations suggested above would be impor- 
tant, if established. Under the conditions it seems to the 
writer impossible to guess, with any factual support, which is 
most likely. The only appeal is to experiment. 

It is the implication of the interpretation of Freeman and 
Abernethy that, had we carried our experiment with the 
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mazes and nonsense syllables over a sufficiently long period, 
we would have found the mazes to be the better retained. 
If valid, this would be highly significant and would necessitate 
a radical reinterpretation not only of our results, but of curves 
of retention for motor skills. "Two experiments have, there- 
fore, been performed to test this and to study, as well, the 
influence of an interpolated relearning. 


EXPERIMENT I 


Method.—In this experiment two of the six materials, the easy maze and the 
8-syllable list, used in the McGeoch and Melton experiment have been employed. 
Group I, composed of 22 college students who were entirely unfamiliar with maze 
learning or with nonsense syllable learning by any method, learned each material to 
a criterion of one perfect trial and relearned it after a period of eight weeks‘ to a 
criterion of three perfect trials in succession. Group II, composed of 18 subjects of 
the same kind, learned each material to a criterion of one perfect trial, relearned it 
after one week to the same criterion, and again relearned it after an interval of eight 
weeks from the original learning, this time to a criterion of three successive perfect 
trials. In each group each subject learned both materials. Half of the group learned 
the maze on Day 1 and the syllables on Day 2. Half reversed this order. The order 
of relearning was, of course, dictated by the order of learning. ‘The maze was screened 
from the subject and the usual instructions were employed. The syllables were 
exposed at a 2-second rate on an electrically driven memory drum and were learned 
by the anticipation method. In every way the conditions of the previous experiment 
were duplicated as closely as possible. The same precautions against rehearsal were 
used, and the results have been treated in the same say. 


Results —The mean learning, relearning and saving scores 
for Group I, composed of 22 subjects who relearned only once, 
after 8 weeks, are given in Table I. The time scores are in 
seconds. 

The saving scores are uniformly higher for the nonsense 
syllables than for the maze habit. The superiority of the 
syllables is greatest in terms of trials. The difference between 
the two materials is much less when saving is computed in 
terms of time and errors, but it is still considerable, and is 
greater in terms of errors than of time. In learning syllable 
lists by the anticipation method, the time score is the trial 


4A duplication of the 10-weeks interval used by Freeman and Abernethy was at 
first intended, but it turned out to be impossible to get all subjects through the original 
learning in time to allow more than 8 weeks to elapse without encroaching on the final 
examination period, with the variable conditions which that would imply. The 


difference between 8 and 10 weeks can scarcely affect the general implications of the 
results. 
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TABLE I 


Group I: LEARNING, RELEARNING AND SAVING ScoRES 
Maze Scores 



































Learning Relearning—8 weeks . 
Saving 
Mean ou Mean ou 
Trials err 17.18 2.33 30.50 5-33 —77.53 
Pime........ 572.09 86.99 437-45 78.03 23.53 
ee 130.81 28.99 90.68 21.64 30.67 
Nonsense Syllable Scores 

(ee 24.18 2.50 15.36 1.67 36.47 
EAvOe8....... 119.04 11.68 53-72 5.69 54.87 








score multiplied by the constant exposure time for the total 
list. ‘The saving score is, thus, the same for time as for trials. 

The results uniformly corroborate those of the prior ex- 
periment. Nonsense syllables are retained somewhat better 
than is the maze habit after eight weeks as well as after one 
week. In the present experiment only two of the former 
materials have been used, but there is no reason to believe that 
these two are not fair samples. 

A repetition of this experiment under conditions analogous 
to those employed by Freeman and Abernethy will assist in 
an interpretation of their results and will show the influence 
of an intervening relearning. Group II learned the same 
materials as did Group I, but relearned them after one week 
and again after a total of eight weeks. Any major differences 
between the tendencies of the saving scores of the two groups 
may be ascribed to the interpolated relearning of the second 
group. The first relearning is introduced after one week to 
conform with the interval of the prior experiment. The 
results yielded by this group are given in Table II. 

It will be noted that the means of the learning records of 
Group II are somewhat larger than are those of Group I. 
This difference occurs with both materials, and evidence will 
be presented later that the relative retention values have 
not been altered thereby. The larger scores of Group II are 
produced by a very few slow learners who happened to fall 
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TABLE II 


Group II: LEARNING AND RELEARNING 
Maze Scores 









































Learning Relearning—1 Week Relearning—8 Weeks 
Mean oyu Mean oyu Mean ou 
cee 24.44 4.22 10.05 2.72 19.50 3.82 
cre 661.38 107.07 161.94 35.51 216.72 40.33 
i ree 153.27 23.70 36.72 11.07 35.22 8.55 
Nonsense Syllable Scores 
eee 27.66 3.86 5.55 .76 16.33 2.09 
Errors...... 140.72 20.47 23.50 3.19 53-33 7.61 














in this group and whose records increase the means and the 
variability. 

The saving scores computed from the data of Table II are 
shown below. The saving at the relearning after 8 weeks has 
been computed in terms of the sum of the records for learning 
and for relearning after one week. Since both are required to 
represent the subject’s antecedent practice with the materials, 
both have been included in the denominator in finding the 
saving. The criterial trials of the relearning after eight weeks 
are not, of course, included in the numerator. The relations 
between the two materials are in no essential way altered, 
however, by computation of saving scores with only the learn- 
ing records in the denominator. 


TABLE III 
Group II: Savinc Scores 








Relearning—1 Week Relearning—8 Weeks 


























Trials Time Errors Trials Time Errors 
arr 58.87 75.51 76.04 45.07 73.70 81.56 
dccccsseel §6RGR 79.93 83.30 52.27 52.27 67.52 





The saving scores after the interval of one week show es- 
sentially the same relationships as do those of the preceding 
experiment. The superiority of the syllables is clear when the 
measurement is in terms of trials. In terms of time and of 











670 JOHN A. MCGEOCH 


errors the syllables are still superior, but by amounts too small 
to warrant any conclusion save that the two materials are 
retained almost equally well when retention is measured in 
these terms. 

When, however, we turn to the saving scores after 8 weeks, 
a different picture presents itself. In terms of trials the sylla- 
ble list is still superior, but to a much smaller degree. In 
terms of time and errors the maze is superior by fairly sub- 
stantial amounts. ‘These results must be interpreted in com- 
parison with those of Group I which relearned only after 8 
weeks and which yielded a clearly greater permanence for the 
syllables than for the mazes. The only experimental differ- 
ence between the two groups lies in the relearning after one 
week, which was absent in Group I and present in Group II. 
It may, then, be concluded that an intervening relearning one 
week after the attainment of the arbitrary criterion of one 
perfect trial, strengthens the maze habit more effectively than 
it does the verbal habits of the nonsense syllable list. This 
may be a result either of a differential influence of the distrib- 
uted practice involved in the interpolated relearning, or of 
the relatively greater relearning effort required by the maze, 
or of a combination of the two factors. 

At this point it is necessary to inquire whether the results 
have been affected by unequal amounts of effort in learning. 
A comparison of Tables I and II will show, not only that the 
means of the learning records of Group II are larger than 
those of Group I, but also that the difference is greater with the 
maze than with the nonsense syllables. There is a consider- 
able amount of evidence in the literature to indicate that 
retention is a function of the amount of learning effort ex- 
pended. It might be, therefore, (1) that the relative increase 
in the retention of the maze habit at the relearning after 8 
weeks, in the case of Group II, is a function of the relatively 
greater time, trials and errors required by that group to 
learn, and (2) that the tendency of the maze habit to be re- 
tained better than the nonsense syllables, when there has been 
an intervening relearning, may be a function of these higher 
learning scores rather than of this first relearning. 
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These possibilities have been investigated by selecting 
groups which are paired in some measure of learning. Which 
learning score is the better measure of effort is not certainly 
established, but time seems a representative one in the case 
of these problems. Accordingly, to test the possibilities 
mentioned, 15 subjects, paired as closely as possible in time 
required to learn the maze, have been chosen from each 
group. The mean scores differ from each other by only 18 
seconds, and the trial and error scores for the same subjects 
differ much less than do those for the original groups. The 
same pairing has been done for nonsense syllables in terms of 
trials (which are the equivalent of time scores when the ex- 
posure time per trial is a constant). The means of the trials 
for the two groups of 15 subjects are identical and the means 
of the errors differ by only three points. The saving scores 
for these paired groups are given in Table IV. 


TABLE IV 


Savinc Scores or Groups EquaTep In TERMS OF TIME REQUIRED 
FOR LEARNING 





























Group I 
Relearning—1 Week Relearning—8 Weeks 
Trials Time Errors Trials Time | Errors 
rer — 86.7 22.0 33-7 
Pe wcandsas 45-4 45-4 59.8 
Group II 
Maze....... 48.4 68.0 69.9 25.6 63.9 75.6 
ee 76.3 76.3 79.0 50.9 50.9 64.5 





The same relationships are found with these as with the 
original groups. When there has been no relearning after one 
week, the syllables are uniformly the better retained. They 
are likewise superior when relearning occurs after one weck. 
At the second relearning, after a total of 8 weeks, however, 
the syllables are superior only in terms of trials, while in terms 
of time and errors superiority passes to the mazes. Since 
there has been an equation of groups in terms of time spent 
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in learning, and a close approximation to equation in terms 
of the other scores, and since the syllables are uniformly the 
better retained after 8 weeks with no intervening relearning, 
it is necessary to conclude that the shift toward a better re- 
tention of the maze in the case of Group II must be ascribed 
to the intervening relearning. This has affected the maze 
habit much more strikingly than it has the verbal habit. 

The method of pairing used has equated the time scores 
on the maze in Groups I and II and, likewise, the trial (time) 
scores on the nonsense syllables. That is, it has equated 
Groups I and II on each problem but has not equated prob- 
lems. It is hardly fruitful to discuss in this context the prob- 
lem of equating two such widely differing problems, and the 
conclusions drawn do not require it. As one additional check 
on the results, however, it will be useful to attempt an equation 
of both groups and problems. This has been done in terms of 
trials. ‘Ten subjects have been found whose trial records on 
the maze in Group I can be approximately duplicated by 
subjects on the maze in Group II and in the nonsense syllable 
learning of each group. There are, thus, available four groups 
of 10 subjects each whose mean trial scores are virtually 
identical. The saving scores of these subjects, given in Table 
V, show the same relationships which were found when the 
total groups were considered. The agreement of the saving 
scores obtained from equated sub-groups with those from the 
total groups removes the possible interpretation of the re- 
sults as due to errors of sampling and their attendant factors. 














TABLE V 
Savinc Scores In TERMS OF TRIALS FOR EQuaTep Groups or TEN SuBJECTS 
Relearning—1 Week Relearning—8 Weeks 
Maze N.S. Maze N.S. 
Ee ee —71.5 35.9 
| eee 65.3 71.9 36.5 53.0 

















While results from mazes and syllable lists are not known 
to be applicable without modification to typewriting and sub- 
stitution, as used by Freeman and Abernethy in their experi- 
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ment, or to skill and memory in general, it is a plausible 
hypothesis that in their experiment the superior retention 
of the typewriting after ten weeks, although it had been no 
better than the substitution after two weeks, is a function of 
the intervening relearning. The tentative generalization 
implied here is that an interpolated relearning strengthens 
acquisitions involving overt movement more than it does those 
of the memorial order. Whether it operates directly or 
through the factor of unequal practice at relearning cannot 
now be determined. Granting its truth, it might be argued 
that this supports the generalization that skilled acts have 
greater intrinsic permanence. As usually stated, the latter 
generalization applies only to the fate of acquisitions over an 
interval during which no practice is supposed to have occurred, 
and in this form the hypothesis offered neither supports nor 
affects it in any way. If this hypothesis is correct, it may 
account for the high retention values found after long in- 
tervals in the case of many acts of skill. Most of these skills 
were learned over a long period of distributed practice, and in 
several cases were given one or more relearnings prior to the 
final measurement after a relatively long interval. If it is 
correct, it offers a key to the understanding of several of the 
phenomena of retention. As it stands, it appears to be true 
in the case of a maze versus a syllable list. It is tentatively 
assumed that the result may be generalized to cover the 
classification skill versus memory. 

The results of this experiment are directly at variance with 
the usual generalization that skill is retained better than is 
memorial material. After one week and after eight weeks, 
when no intervening relearning occurs, the syllable list is 
better retained than is the maze habit. Its superiority is 
considerable in terms of trials, slight in terms of time and 
errors. When, however, each material is given an inter- 
polated relearning after one week, the superiority of the syl- 
labels is, at the second relearning after eight weeks, diminished 
in terms of trials and passes to the mazes in terms of time and 
errors. 

It will at once be evident to anyone who compares the means 
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and saving scores of this experiment with those of the previous 
study that there are wide differences. The relative retention 
values of the materials compared are the same in tendency, 
but the magnitudes of the scores from which savings are com- 
puted vary, as do the magnitudes of the saving scores. This 
might seem at first consideration to render doubtful the com- 
parability of the conditions of the two experiments. There is, 
however, one factor which accounts for the difference and 
which renders it unimportant in so far as the present conclu- 
sions go. In this experiment, where only one sample of each 
material has been used, there is little opportunity for transfer 
effects to occur. In the former experiment, three samples of 
each material were used and were learned in a counterbalanced 
practice order such that one sample of each was learned first of 
that material by eight subjects, second by eight others, and 
third by eight others, the whole scheme working out to give 
all samples equal average position. This arrangement renders 
inevitable the possibility of both positive and negative trans- 
fer from sample to sample and is almost certain to produce 
results which are different in magnitude from those which 
would appear if each sample were learned in complete isola- 
tion from any influence by others.® One would not, however, 
expect the tendencies of an experiment in which all samples 
are included and rotated in a counterbalancing order to be 
different from those of an experiment in which each sample is 
learned independently by a separate group of subjects. The 
fact that the relative retention values of this and the former 
experiment agree so completely constitutes effective proof 
that the two methods can yield similar tendencies. 


6 Evidence of the influence of transfer effects within the counterbalanced order 
may be cited from the data of the 1929 experiment. In this experiment the eight 
subjects who learned the easy maze first, with no previous maze experience, required 
an average of 16.62 trials, 676.37 seconds, and 193.62 errors. The eight who learned 
this maze in the last position in the practice order, with prior experience of learning 
and relearning two other mazes, required 5.75 trials, 127.37 seconds, and 34.75 errors. 
Obviously the practice effects are very large. The saving scores are much larger at 
the first position in the practice order than at the last position. This is in agreement 
with the influence of transfer upon retention in the case of rational learning. Cf. 
J. A. McGeoch and V. J. Oberschelp, The influence of length of problem and of transfer 
upon rational learning and its retention, J. Gener. Psychol., 1930, 4, 154-168. 
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EXPERIMENT II 


In this experiment a comparison is made between the retention values of a Warden 
10-alley stylus maze,* and a 10-letter Peterson Rational Learning Problem,’ after 
intervals of one week and three weeks. Each material was learned at a single sitting 
to a criterion of one perfect trial. Half of the subjects learned one material on Day 1 
and the other material on Day 2, and the other half reversed this order. Group I, 
consisting of 20 college students, relearned to a criterion of three successive perfect 
trials after an interval of three weeks. Group II, consisting of 20 college students, 
relearned after one week and again after three weeks, each time to a criterion of three 
successive perfect trials. All subjects were, at the beginning of the experiment, 
entirely unacquainted with the problems used, and all were given the usual instructions. 
The same precautions against rehearsal were observed as in Experiment I. The 
scoring of the rational learning problem in terms of trials, time, logical, perseverative 
and unclassified errors is well known. For the present purpose the three kinds of 
errors have been included in a single mean. The instructions used by Peterson were 
employed. Errors in maze learning were counted as in Experiment I and in the 


1929 experiment. 

The data yielded by Group I are given in Table VI. In 
this and in the following table the mean learning scores include 
the criterial learning trial; the mean relearning scores do not 
include the criterial trials. In computing saving scores, how- 
ever, the criterial trials and time of all preceding learning 
(and relearning, if any) have been included in the denominator 


TABLE VI 


Group I: LEARNING, RELEARNING AND SAVING SCORES 


Maze Scores 



































Learning Relearning—3 Weeks 
Saving 
Mean oM Mean ou 
hb eee aes 20.65 2.12 15.75 1.20 23.72 
BEscesscaccceest SRE 276.57 604.95 59.81 43.08 
eres 83.55 11.71 57-85 6.12 30.76 
Rational Learning Scores 

0 eer 5.50 58 2.85 31 48.18 
ere 464.95 46.30 232.95 26.77 49.87 
Bsc acaessnsees 72.60 10.29 29.70 4.57 59.09 





€C, J. Warden, Primacy and recency as factors in cul-de-sac elimination in a 
stylus maze, this JOURNAL, 1924, 7, 98-116. 
7 J. Peterson, Experiments in rational learning, Psychol. Rev., 1918, 25, 443-467. 
The problem used in the present experiment was: 
A & © D &E F G H 1 J 
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but the criterial scores of the particular relearning at which 
the saving is being computed have not been included in the 
numerator. It is obviously necessary to take into account all 
antecedent practice, whether criterial or not, but it is not 
necessary to include the criterial trials of the particular re- 
learning because the material is assumed to have been re- 
learned by the time the criterion can be met and the criterial 
trials may be thought of as tests of the relearning. 

In terms of trials and of errors, the rational learning prob- 
lem is much better retained than is the maze. In terms of 
time, the difference is small, with the rational learning problem 
still superior. 

Group II in this experiment follows the pattern of the 
corresponding group in Experiment I, but with its second re- 
learning after an interval of three weeks. The 20 subjects 
relearned after one week and again after three weeks. The 
results appear in the following tables. 


TABLE VII 
Group II: LEARNING AND RELEARNING 


Maze Scores 





















































Learning Relearning—1 Week Relearning—3 Weeks 
Mean ou Mean ou Mean ou 
re 28.85 3.81 14.20 1.84 8.65 1.75 
Fee 1630.70 164.52 521.65 83.11 264.70 50.87 
Beeees...... 134.55 15.06 39.00 6.35 19.25 3-40 
Rational Learning Scores 
ae 6.60 55 2.40 37 1.30 22 
a 640.00 57.68 160.55 34.63 74.00 13.79 
es 6408s 95.15 16.21 18. 5.61 8.45 2.49 
TABLE VIII 
Group II: Savinc Scores 
Relearning—1 Week Relearning—3 Weeks 
Trials Time Errors Trials Time Errors 
Maze....... 50.77 68.01 71.01 81.21 88.12 88.90 
Rational L...| 63.63 74.91 80.45 89.16 91.57 92.57 
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The saving scores after the one-week interval, though 
larger than those after the three-weeks interval of Group I, 
show similar relationships. By every measure the rational 
learning problem is retained better than is the maze. Several 
of the differences are small, but their consistency warrants the 
conclusion that after one week and after three weeks the ra- 
tional learning problem is better retained than is the maze 
habit. 

The saving scores of Group II after the three-weeks inter- 
val are uniformly higher than after the interval of one week. 
They are much higher than the corresponding scores of 
Group I which had no relearning after a week. Since the 
variable condition is the interpolated relearning, we must 
conclude that the height of the saving scores of Group II after 
the longer interval is a function of the relearning after one 
week.$ 

Important, also, for the present problem is the fact that, 
after the interval of three weeks, the saving scores of the two 
materials approach each other more closely than after the 
one-week interval. This means, as inspection of Table VIII 
will show, that from the standpoint of retention after three 
weeks the maze has profited more from the relearning after 
one week than has the rational learning problem. This differ- 
ence in the extent to which the two materials profit from the 
interpolated relearning is not as great as that which appeared 
in Experiment I, but it is exactly the same in tendency. In 
the present experiment the interval is shorter, the rational 
learning problem is substituted for the syllables, and the mag- 
nitudes of the scores compared are different. These factors 
are sufficient to account for the smaller differences found. 

The rational learning problem is one in which ideational 
factors figure largely and in which no overt movements other 
than the habituated acts of speaking the numbers are directly 


8 It is possible that the greater learning effort required by this group, as repre- 
sented by their higher learning records, might have contributed to its better retention 
after three weeks. As a check upon this the means for the original learning of Group II 
were made to approach those of Group I by the elimination of the extreme cases. 
The saving scores were then computed from the new means. The magnitudes of the 
scores are slightly changed but the relationships remain unaltered. 
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involved. The greater permanence of this problem com- 
pared to that of the maze habit, after both one and three 
weeks, when no relearning intervenes, constitutes further 
evidence against the current generalization that skilled acts, 
involving overt movements, are better retained than are 
verbal and memorial materials. The fact, moreover, that the 
two materials are considerably closer together in saving values 
after three weeks, when a prior relearning has occurred than 
when it has not, implies that a relearning one week after the 
attainment of the arbitrary criterion of one perfect trial 
strengthens the maze habit more effectively than it does the 
rational learning. ‘This directly parallels the corresponding 
conclusion of Experiment I and supports the interpretation 
offered there of the results of Freeman and Abernethy. 


SUMMARY 


The retention values of the easy maze and the 8-syllable 
list of the McGeoch and Melton (1929) experiment have been 
compared after an interval of eight weeks, both when no inter- 
vening relearning has occurred (Group I) and when there has 
been a first relearning after one week (Group II). 

1. When no intervening relearning occurs, the nonsense 
syllables are uniformly somewhat better retained than is the 
“maze habit. The superiority of the syllables is greatest in 
terms of trials, but is still considerable in terms of both time 
and errors. The failure of these results to substantiate the 
current statement that skilled acts are better retained than 
are memorial materials corroborates the outcome of the 1929 
experiment and fails to fulfil the implication of the results of 
Freeman and Abernethy. 

2. In Group II, the saving scores at the first relearning, 
after one week, show essentially the same relationships as do 
those of the 1929 experiment and constitute an independent 
corroboration of the results of that experiment. The saving 
scores after eight weeks, when a first relearning after one week 
has preceded, present a different set of relationships. In 
terms of trials the syllable list is still superior but to a much 
smaller degree. In terms of time and of errors the maze is 
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superior by substantial amounts. It is concluded that this 
difference between the relative retention values of Groups I 
and I] is a function of the relearning after one week in the case 
of Group II, and that this relearning has strengthened the 
maze habit more effectively than it has the nonsense syllable 
list. It is highly probable, in the light of this result, that the 
superior retention, after ten weeks, of the typewriting in the 
Freeman and Abernethy experiment, although it had not been 
superior after two weeks, is a function of the first relearning, 
after two weeks, and not of the inherently greater permanence 
of motor activities after longer intervals. 

A similar comparison has been made of the retention values 
of the Warden 1o-alley maze and a 10-letter Peterson Rational 
Learning Problem after an interval of three weeks, both when 
no intervening relearning has occurred (Group I) and when 
there has been a first relearning after one week (Group II). 

1. When no intervening relearning occurs, the rational 
learning problem is better retained than is the stylus maze. 

2. In Group II, the saving scores at the first relearning, 
after one week, show a corresponding and consistent superior- 
ity for the rational learning problem. At the second relearn- 
ing, after three weeks, the rational learning problem is still 
the better retained of the two, but the differences are consider- 
ably smaller than in Group I where no intervening relearning 
occurred. This supports the interpretation offered in the 
case of Group II of Experiment I. 

The results of these two experiments agree in finding that 
materials which are verbal and ideational in character are, 
when learned to a uniform criterion, uniformly retained, over 
both relatively short and relatively long intervals, better than 
are maze habits. ‘They agree, also, in showing that a relearn- 
ing after one week strengthens the maze habit more effectively 
than it does the verbal and ideational acquisitions, although 
the factors producing this differential reinforcement cannot be 
determined under the present experimental conditions. They 
agree, finally, in contradicting, within the limits of the ma- 
terials and methods employed, the current generalization that 
skilled acts are better retained than are memorial materials. 
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Although these experiments have found the memorial 
materials used to be retained better than the acts of skill with 
which they have been compared, it is not assumed that the 
memorial materials possess any greater intrinsic permanence. 
It is more probable that their superior retention values are a 
function of experimental conditions not isolated in these ex- 
periments. 


(Manuscript received August 15, 1931) 











MENTAL SET IN RELATION TO RETROACTIVE 
INHIBITION ! 


BY OLIVE P. LESTER 
The University of Buffalo 


SEcTION I. INTRODUCTION 


The fact that the recall of any learned material is inter- 
fered with or decreased by the interpolation of certain forms of 
activity between the original learning and recall has been 
clearly demonstrated in numerous investigations. Psycholo- 
gists have denoted this occurrence by the term retroactive 
inhibition. The investigation under consideration assumes 
the existence of this retroactive phenomenon and then raises 
the question as to the possible relation between inhibition and 
learning set or attitude. More specifically stated, what is the 
influence or effect of the following attitudes on the suscepti- 
bility of learned material to retroactive inhibition? 

1. The expectation of recall. 

2. The expectation of interpolated material before recall. 

3. Information as to the possible effects of interpolated 
material. 

4. The assumption of an active attempt to avoid the detri- 
mental influence of the interpolated material. 

Space does not permit an historical review of the experi- 
mental literature on retroaction, and attitudes at the time of 
learning. ‘To the writer’s knowledge, no work on the relation 
of mental set to retroactive inhibition has been reported, 
though much experimental data on set in relation to learning, 
and recall is at hand. 


SecTIoNn I]. ProcEDURE 


A. MATERIAL 


The memory material employed throughout the investigation, both for the 
original and for the interpolated learning, consisted exclusively of standard lists of 
twelve three-letter nonsense syllables. 





1 From the Psychological Laboratory of The University of Chicago. The writer 
wishes to express her appreciation to Professor A. G. Bills for the helpful advice and 
assistance given by him throughout this experimentation. 
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B. Apparatus 


An electrically driven memory drum, of the type devised by Clark Hull, consti- 
tuted the apparatus. The lists of nonsense material were placed on the drum, and 
the timing device set so as to allow a constant exposure of three seconds for each 
nonsense syllable. Thus, a complete presentation of the twelve syllables necessitated 
thirty-six seconds. The drum continued to rotate until each subject was able to 
give one perfect recitation of the list, i.¢. to anticipate by spelling aloud, in each case, 
the syllable following the one then being presented to him on the drum. The list of 
nonsense material was then considered learned. The automatic control of the presenta- 
tion time left the experimenter free to record the learning data of the subjects. 


C. SuBJEcTS 


The subjects were undergraduate students at the University of Buffalo during 
the college year 1929-30. The data in the main were obtained from freshman men 
and women, although the results on a number of second year students were also 
included. In those instances in which sophomores were used as subjects, only those 
who were not carrying the regular elementary course in psychology were included; 
or, if such students were employed, the experimenter made certain, in each case, that 
class discussions had not, up to that time, involved any material on retroactive inhibi- 
tion, set at time of learning, and the like. This precaution insured that very little, 
if any, knowledge existed on the part of the subjects concerning the problem under 
investigation. 

D. Metuops 

The experiment was performed to answer the question whether directions at the 
time of learning (set) would have any effect upon the susceptibility of learned material 
to retroactive inhibition. Such a problem called for the usual learning and retention 
set-up, 1.¢. the learning of the original material; a time interval between original 
learning and recall, during which the interpolated material was introduced; the subse- 
quent recall of the originally learned material. 

In this experiment, the time interval between learning and recall was twenty-four 
hours; the interpolated learning material was introduced immediately prior to recall. 
The retention of the original material was measured by the anticipation and relearning 
methods, t.¢. the number of nonsense syllables correctly anticipated during the first 
trial of the relearning constituted one measure of retention, and the total number of 
trials necessary to relearn the list to the point of one perfect anticipation of each of 
the twelve succeeding nonsense syllables was taken as the second measure of retention. 

Simple, uniform directions for the recall technique were given to all subjects 
just prior to recall. 

The original learning, the time interval, the interpolated learning, and the recall 
procedure remained the same throughout the experiment. Different directions read 
by the subject at the time of learning introduced the factor of varying set or attitude 
—the one variable in our experimental investigation. These different directions, as 
presented to the subjects, determined the specific conditions of the experiment which 
were as follows: 


Condition I 


Learn O? No I 3 Recall 
No expected recall 





2 Abbreviation for original learning. 
3 Abbreviation for interpolated learning. 
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Condition II 


Learn O No I Recall 
Expected recall 


Condition III 
Learn O I introduced Recall 
No expected recall 
No expected interpolated task 


Condition IV 
Learn O I introduced Recall 
Expected recall 
No expected interpolated task 


Condition V 
Learn O I introduced Recall 
Expected recall 
Expected interpolated task 


Condition VI 
Learn O I introduced Recall 
Expected recall 
Expected interpolated task; S. told 
something of its possible effects 


Condition VII 
Learn O I introduced Recall 
Expected recall 
Expected interpolated task; S. told 
something of possible effects, and 
urged to resist effects 


The printed directions, read by the subjects at the time of learning, with reference 
to the expectation of recall and interpolated learning, were as follows: 


Condition I 


No printed directions were given at the time of learning the original material, 
but the subject was instructed to return the following day for the purpose of learning 
another list. This second learning was actually carried out, but only after the recall 
of the original list. 

Condition II 


“You are going to be required to recall, tomorrow at this time, the list of nonsense 
syllables you are now learning. Do not practice on the list after leaving here.” 


Condition II] 


No printed directions were given at the time of learning the original list, but the 
subject was instructed to return the following day in order to learn another list. The 
learning of this list on the succeeding day, since it constituted, in reality, the inter- 
polated task, was actually carried out, after which the recall of the original material 


took place. 
Condition IV 


“You are going to be required to recall, tomorrow at this time, the list of nonsense 
syllables you are now learning. Do not practice on the list after leaving here.” 
These directions were identical with those of Condition II. 


45 
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Condition V 


““You are going to be required to recall, tomorrow at this time, the list of nonsense 
syllables you are now learning, but, before recalling this list tomorrow, you will learn 
another list of nonsense syllables similar to the list you are learning today. Do not 
practice on the list you learn today after leaving here.” 


Condition VI 


“You are going to be required to recall, tomorrow at this time, the list of nonsense 
syllables you are now learning, but, before recalling this list tomorrow, you will learn 
another list of nonsense syllables similar to the list you are learning today. The 
effect of learning a second list is usually detrimental to the remembering of the first 
list—that is, it is likely to cause confusion. You tend to mix up syllables learned in 
the second list with those in the first list. Do not practice on the list you learn today 
after leaving here.” 

Condition VII 


“You are going to be required to recall, tomorrow at this time, the list of nonsense 
syllables you are now learning, but, before recalling this list tomorrow, you will learn 
another list of nonsense syllables similar to the list you are learning today. The 
effect of learning a second list is usually detrimental to the remembering of the first 
list—that is, it is likely to cause tonfusion. To avoid mixing up the syllables of the 
second list with those of the first list, take special care to keep them separate. Learn 
the first list in such a way that you will not confuse it with the second list. Do not 
practice on the list you learn today after leaving here.” 

In order to reduce to a minimum the variability necessarily accompanying oral 
directions given by an experimenter, each subject was requested to read the directions, 
and then to state his understanding of them. In the event that the subject omitted 
an important point, he was referred once more to the instructions in question. 

It will be noted that each condition of the experiment, with the exceptions of I, 
II, and III, presented more and more information relative to the nature of the experi- 
mental set-up—the expectation of recall, the expectation of the interpolated list, 
further knowledge concerning the possible effects of the interpolated list, and, finally, 
the suggestion to learn the material in such a way as to eliminate or at least to mitigate 
the effects of the interpolated list on the recall of the original list. 

Condition III should furnish data showing what may be called ‘raw’ retroactive 
effects. Particularly is this true if the results of this condition are compared with 
those of Condition I where no interpolated learning was introduced into the experi- 
mental procedure. Then, comparing the findings under Condition III with those 
under Conditions IV, V, VI, and VII, we may possibly draw certain conclusions 
concerning the effect of attitude or set at time of learning on the disruptive effects 
called retroaction. 

Of the two hundred subjects participating in this experiment, twenty-one served 
under both Conditions I and II; twenty under Condition III; thirty-three under Condi- 
tion IV; thirty-four under Condition V; thirty-five under both Conditions VI and VII. 
All of these subjects came to the experimenter on four successive days.‘ On each of the 
first two experimental days, the subjects learned a list of nonsense syllables similar 
to the list of so-called original material to be learned on the third experimental day. 





4 In some instances, one day intervened between the first and second experimental 
days, or between the second and third experimental days; but at no time did more 
than twenty-four hours elapse between the third and fourth experimental days. 
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One of the objects of these two practice sittings was to familiarize the subjects with the 
experimental apparatus and procedure; the other, to provide the experimenter with 
some basis for equating the different groups of individuals. On the third experimental 
day, the subjects learned the nonsense material under the experimental conditions 
specified, and on the fourth experimental day, the recall was measured. 


An average of the number of trials necessary in order to 
learn the first two lists was computed for every subject. 
Using such averages for the first sixty-eight persons partici- 
pating in the experiment, the median, upper, and lower quar- 
tile points were computed. They were found to be as follows: 


Trials to learn 


EE ET re 30.75 
ET eet ees 
ac houeeneeuwedsnetees cues ee 


The range of the average trials to learn was from 12-59. 

Approximately the same number of poor, average, and 
good learners ® were assigned to each group so that, under the 
several conditions, the groups are comparable as to the aver- 
age number of trials necessary to learn the first two lists of 
nonsense syllables. 

It may be argued that two practice trials do not afford 
sufficient basis for a classification of individuals as poor, 
average, and rapid learners. Possibly this argument is en- 
tirely valid, yet it seemed advisable to make some effort to 
equate the groups in as many respects as possible, 1.¢. age, 
sex, college year, and, lastly, learning ability. Perhaps the 
only achievement was a rough classification within the groups 
of those individuals whom Meumann designated as ‘die 
schnellen und die langsamen Lerner.’ 

5 The poor learners were classified arbitrarily as those who learned in more than 
38.00 trials; the average learned in from 25.26 to 38.00 trials; and the good learners 
were those who succeeded in 25.25 trials or less. 

*It was suggested that the groups be equated as to intelligence or possibly high 
school record. The correlation between intelligence, as measured by the American 
Council on Education Test, and the average number of trials to learn a list of nonsense 
syllables was r = + .18; between high school rank and average number of trials, 
r = — .03. The first correlation above was computed on the basis of 185 cases; 
the second, on 203 cases. The decided lowness of these correlations influenced the 
writer against employing high school records or intelligence test scores in equating 


the groups. A larger number of practice trials in learning lists of syllables might 
have raised these correlations, but this seems questionable. 
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All of the learning throughout the experiment was done in 

the same room—a large quiet ofice—with only the experi- 
menter present. No subject was asked to participate in the 
experimental work prior to 12 o’clock. In so far as possible, 
a subject worked at the same hour on the various experimental 
days. 
- The question of experimental controls also enters. When 
sets of directions are being given and their effects studied, 
and particularly when these directions are so arranged as to 
give more and more information concerning the experiment, 
it becomes of considerable importance that a subject, suppos- 
edly working under one set of directions or conditions, is not 
influenced by directions previously given to other subjects. 
In the present experiment, such a difficulty was obviated by 
completing each experimental condition, as far as the number 
of subjects was concerned, before proceeding to the subsequent 
condition.’ 

A phase of the experiment which may be open to criticism 
is the lack of control of the subjects by the experimenter dur- 


7 When it seemed advisable, in order to test the significance of various differences, 
to add to the number of cases under Conditions IV, V, VI, and VII, the rather trouble- 
some situation arose of using subjects under Conditions IV and V, for example, whose 
acquaintances had, possibly, already worked under Conditions VI and VII. This 
difficulty was met in the following manner: Additional subjects under Conditions IV 
and VI were informed that the experimenter was working on an entirely different 
phase of the problem, one under which students had not previously served as subjects. 
Needless to say, subjects were added to Condition IV before additions were made 
to Condition VI. Condition V was held over until the Summer Session of 1930. 
At this time subjects of the same relative ages, and apparently unfamiliar with the 
experimental problem, from the elementary psychology course were employed. Con- 
dition VII, because it gave the most complete information, was not affected. The 
constancy of the average number of trials necessary to relearn the material and the 
number of syllables recalled during the first trial led to the conclusion that the additional 
subjects in Condition VI were unaffected by any other directions. The shift of means 
in the two measures of recall in Conditions IV and V, when added subjects had been 
employed, led the experimenter to run twenty-seven entering freshmen of the 1930-31 
school year under these conditions—thirteen of these under Condition IV and fourteen 
under Condition V. The mean trials of these additional subjects to relearn and 
anticipate syllables resembled more the original twenty cases under each condition. 
This would seem to indicate that those cases in Conditions IV and V, added during 
the Spring and Summer Session of 1930, had been influenced by directions other than 
those given by the experimenter. For this reason, Conditions IV and V, as discussed 
here and in the following section, are each composed of the original twenty cases plus 
the added cases drawn from the entering freshman group of the college year 1930-31. 








MENTAL SET AND RETROACTIVE INHIBITION 687 


ing the twenty-four hours which elapsed between the learning 
of the original material and the recall. Although all subjects 
were directed not to practice on the lists after leaving the ex- 
perimenter, there was no guarantee of their compliance with 
the request. In this regard, the present investigation suffers 
no more than any experiment in which absolute control of the 
subjects during the time interval between learning and recall 
is impossible or impracticable. Whenever directions stating 
that a recall will take place are given, any experimenter 
necessarily throws himself upon the weakness, or perhaps the 
strength, of his human material, and his findings must always 
be evaluated with that uncontrolled factor in mind. Of 
course an experimenter may attempt to insure accuracy by 
asking his subjects whether or not they have practiced on the 
lists of learned material, or he may utilize the more objective 
criterion of a recall performance which, by its obvious per- 
fection, indicates additional practice. Such results would 
necessarily be discarded. However, it seems, in the main, 
practically necessary to rely on the integrity of the subjects 
themselves. 
Section III. ReEsutts 


Conditions I and II were introduced into the experiment 
merely as controls with the view, first, of ascertaining the 
effect of knowledge, or lack of knowledge, of recall on the re- 
tention of a previously learned list of nonsense syllables and, 
secondly, for the purpose of comparing retention in the pres- 
ence and absence of interpolation of similar learning material. 
Table 1, given below, presents the retention in terms of num- 
ber of trials to relearn and the number of syllables recalled 
the first trial for the first three conditions of the experiment: 


TABLE 1 


AVERAGE TRIALS TO RELEARN, AND AVERAGE NuMBER SYLLABLES RECALLED 
First Tria 1n Conpitions I, II anp III 











Ave. No. Ave. No. 
Cond. No. of Cases Trials to P.E. Syllables PE. 
Relearn List Recalled 
ist Trial 
I] 21 6.05 421 4.57 340 
a ” 55.55 525 -400 .132 
B 
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The retention measured in trials to relearn in Condition 
II is significantly greater than the retention under Condition 
I, being 3.05 times the P.E. of the difference. At the same 
time, no significant difference exists between the conditions 
in the number of syllables recalled on the first trial. If one 
assumes that directions at time of learning,—in this case, 
knowledge of a subsequent recall,—establish a set or Einstel- 
lung that facilitates or energizes the learning process, such 
facilitation is not apparent in the amount actually recalled 
under Condition II. However, if one distinguishes as did 
Ebbinghaus between conscious retention and unconscious 
retention, then in Condition I] there is evidence of greater 
unconscious retention. If the learning process and retention 
are thought of as involving basically a modification of neural 
processes, then a saving in trials necessary to relearn is a 
good index of the amount of permanent learning or acquisi- 
tion. ‘The amount of retention expressed by the savings score 
is 74.9 percent in Condition II and 66.6 percent in Condition I. 

Contrasting Condition III with Conditions I and II, the 
disturbing effects of an interpolated list on the retention of 
previously learned material may easily be observed. As was 
to be expected, the differences in retention between Conditions 
II and III are greater than between Conditions I and III. 

Although the present experimental procedure was not 
arranged to test the effect of the temporal position of the inter- 
polated learning on the amount of retroactive inhibition, 
nevertheless, it incidentally brings out clearly the retroactive 
effects of such interpolated learning when it is introduced 
prior to recall. This evidence tends to refute Muller and 
Pilzecker’s perseveration hypothesis as an explanation of 
retroactive inhibition. 

Table 2 illustrates the retention in Condition III compared 
with retention in the other conditions of the experiment: 
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TABLE 2 


AVERAGE TRIALS TO LEARN AND RELEARN, AND AVERAGE NUMBER SYLLABLES 
RECALLED First TRIAL UNDER ConpiTions I-VII or ExpEerRIMENT 











Ave. No.| Ave. No. Ave. No. Percent 
No. of | Trials to | Trials to . Savings | Syllables _ of 

Cond. Cases Learn | Relearn P.E. Score Recalled P.E. Material 

List List ist Trial Recalled 
I 21 24.33 8.14 .540 66.6% | 4.2 322 35-3 
II 21 24.14 6.05 421 79.4% | 4.57 .340 38.1 
Iil 20 25.30 11.35 525 55-2% 400 .132 3.3 
IV 33 24.42 8.21 338 66.49 | 1.18 .180 9.8 
V 34 24.32 8.79 .330 63.9% | 1.23 .138 10.2 
VI 35 22.91 7.48 .288 67.4% | 1.54 152 12.8 
Vil 35 23.48 6.83 .292 70.9% | 3.08 .261 25.7 





























Condition III illustrates what has previously been referred 
to as the ‘raw’ effect of retroactive inhibition. In this con- 
dition, information, directions and mental set, in their rela- 
tion to the phenomenon of retroaction, are at a minimum. 
This may, therefore, be considered the basic condition against 
which the other conditions involving retroaction are compared. 
The effectiveness of directions at the time of learning must be 
measured in terms of the greater amount of retention in those 
conditions where such directions are given as compared with 
the amount of retention in Condition III. 

Condition IV is the first one in which directions are intro- 
duced along with the presentation of the interpolated learning. 
In this condition, the subjects were aware at the time of 
learning that a recall of the original material was to occur. 

The directions given under Condition V led the subjects to 
expect a recall of the original list and also acquainted them 
with the fact that another list was to be learned prior to the 
recall. 

The next set of directions—those employed under Condi- 
tion VI—informed the subjects at the time of learning the 
original lists that a recall would take place, and that another 
list of similar syllables would be learned prior to recall. The 
possible interference effect of this list on the retention of the 
original learning was also stated. 

In Condition VII, the subjects were provided with all of 
the information given in Condition VI, and the following 
statement was added: “‘To avoid mixing up the syllables of the 
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second list with those of the first list, take special care to keep 
them separate. Learn the first list in such a way that you will 
not confuse it with the second list.” 

A glance at the preceding table brings one to the obvious 
conclusion that set or attitude at time of learning, whether it 
be knowledge of recall, knowledge of the presentation of the 
interpolated material, knowledge of its possible effects, or an 
attempt to avoid the detrimental influence of the interpolation, 
has a significant effect on the amount of retention. This in- 
crease in the amount of retention holds true under both 
methods of measurement employed in this experiment. 

A comparison of Conditions J and III, and II and IV 
respectively, shows definitely that retroaction is greater when 
measured in terms of recall than when tested by relearning. 
When interpolated learning is introduced, there is an average 
decrease in recall of 30.2 percent. If the retention measure is 
in terms of relearning, there is an average decline of 12.2 per- 
cent. 

Having established experimentally that mental sets 
brought about by means of directions given at the time of 
learning do augment retention or, contrariwise, retard or re- 
duce retroaction, we encounter the further question: Do the 
directions act differentially on the amount of retention? 
Does a knowledge of recall and expectation of an interpolated 
list increase retention more than a mere knowledge of recall? 
Further, does knowledge of the possible effects of an interpo- 
lated list increase retention more than just the bare expecta- 
tion of sucha list? And, lastly, will the directions, indicating 
that subjects should learn the original list in such a way as to 
prevent the interference effects of the interpolated list, be 
still more effective in increasing recall? 

If, in arriving at conclusions, consideration is restricted to 
those differences between the means that are significant, the 
data seem to indicate little difference in retention between 
Conditions IV, V, and VI. Apparently, if subjects are not 
told to avoid the retroactive effect, 1.¢. to avoid interference 
resulting from the interpolated list, it is of little avail to notify 
them that there is to be such a list or to inform them of its 
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effect. In fact, mere anticipation of the presentation of a 
second list of similar material to be learned, or mere knowledge 
of the possible results therefrom seems no more effective in 
overcoming retroactive effects than knowledge of a recall of the 
original learning. 

This conclusion applies to retention as measured by both 
criteria. Table 3, which follows, presents these differences 
between means in Conditions IV, V, and VI, the P.E. of the 
differences, and the number of times the differences exceed 


the P.E. 


TABLE 3 


DiFFERENCE BETWEEN Means, P.E. or DIFFERENCE, AND Times DIFFERENCE 
Exceeps P.E. 1n Conpitions IV, V, anno VI 














Difference Between ‘ , Times Difference 
Means P.E. of Difference Exceeds P.E. 
Cond. 
Trials Syllables Trials Syllables Trials Syllables 
to Recalled to Recalled to Recalled 
Relearn First Trial Relearn First Trial Relearn First Trial 
IV-V 58(1V) .05(V) 473 .22 1.23 .02 
IV-VI .73(V1) .36(V1) 444 .236 1.65 1.53 
V-VI 1.31(VI) 31(VI1) .438 205 2.99 1.51 























Note: Numerals in parenthesis indicate condition favored. 


However, if one follows the trends in the data presented by 
Table 3, it would appear that the results under Condition V1 
are consistently better than those under Condition IV or V. 
This result holds in both of the measures of retention. Fur- 
ther illustration of this tendency is apparent in the data given 
in Table 2. Comparing savings scores, Condition VI shows a 
retention of 67.4 percent as compared with 66.4 percent and 
63.9 percent under Conditions IV and V respectively. If 
measured by the percent of material recalled, retention shows 
a progressive increase in amount as one proceeds from Condi- 
tion IV to Condition VI. Thus, there appears to be a tend- 
ency for retention to be a function of the inclusiveness of 
directions. 

Condition VI presented the opportunity for more motiva- 
tion in the original learning, though this motivation was not 
established directly by instructions. When subjects are made 
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aware of the possible effects of a second learning situation, as 
in Condition VI, for some, if not all, of them there must be a 
self-induced set to counter-act, if possible, the disturbing 
effect. 

The absence of any very significant differences between 
Conditions V and VI may be explained in part, at least, by 
assuming that the conditions were basically similar for some of 
the subjects under both conditions. Some subjects under 
Condition V, when told of the subsequent presentation of an 
interpolated list, inferred its effect. This was evidenced by 
expression of “Oh, that will mix me up!” Further evidence 
of the same reaction was obtained by asking a number of 
subjects, after they had completed the experiment, what 
ideas as to the effect of learning a similar list of syllables oc- 
curred to them upon reading the printed directions. They 
unhesitatingly replied, ‘““Oh, I thought it would interfere.” 

The possibility also arises that those subjects in Condition 
V, who were not far-sighted or imaginative enough to foresee 
the possible consequences of the directions, benefited no 
more from knowledge of a recall and interpolated list, than did 
those subjects who knew only of a recall. This suggestion 
may account for the absence of any difference in retention 
between this Condition and Condition IV. 

Any experimenter, at this point, finds himself confronted 
by those factors of individual differences which make the stim- 
ulus he controls, and believes to be somewhat objectively 
defined, a variable thing. A set of directions, stating that an 
interpolated list will be introduced prior to relearning, will 
produce in some of even a naive group of subjects the thought, 
apprehension, or attitude of expected interference; in the case 
of other subjects, equally naive as regards such concepts as 
retroactive inhibition and the like, no such outcome is envis- 
aged. It appears to the present experimenter that such con- 
siderations may have eliminated or, at least, dimmed the 
supposedly clear-cut quantitative differences in directions. It 
is for this reason that the trends in favor of Conditions V and 
VI are significant. ‘They seem to transcend the very likeness 
that may have come into the directions. 
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Turning now to a consideration of Condition VII, it will be 
remembered that subjects were here directed to learn in such 
a way that the second list would not be likely to interfere with 
the remembering of the first list. This attitude, induced by 
such directions, was presumably the most active. The follow- 


ing table shows Condition VII compared with Conditions IV, 
V, and VI. 


TABLE 4 


DIFFERENCE BETWEEN Means, P.E. or DIFFERENCE, AND TIMES DIFFERENCE 
Exceeps P.E. 1n Conpitions IV, V, VI anv VII 


























Difference Between : Times Difference 
Means P.E. of Difference Exceeds P.E. 
Cond. _—— 
Trials Syllables Trials Syllables Trials Syllables 
to Recalled to Recalled to Recalled 
Relearn First Trial Relearn First Trial Relearn First Trial 
IV-VII_ | 1.37(VII) i 1.90(VIT) 447 317 3-09 6.00 
V-VII | 1.96(VII) | 1.85(VIT) 441 295 4-44 6.27 
VI-VII 65(VII1) | 1.54(VI11) .410 .302 1.58 5.09 














Note: Numerals in parenthesis indicate condition favored. 


All of the above differences in retention, with the exception 
of the difference between Conditions VI and VII, measured by 
trials to relearn, are significant, and show Condition VII to be 
the superior one. 

The effectiveness of a more active set as represented by 
Condition VII at the time of learning, in relation to retroactive 
inhibition may possibly be more vividly illustrated by a com- 
parison with retention in Conditions I and II. Measured by 
the number of trials to relearn, Condition VII is superior to a 
condition in which no retroactive effects occur; and only 
slightly inferior to that in which a knowledge of recall and no 
retroactive effect occurs. Computing the savings score, there 
is a retention of 70.9 percent in Condition VII; 66.6 in Condi- 
tion I; and 79.4 in Condition II. In terms of the percentage 
of material actually recalled, the retention is 25.7 for Condi- 
tion VII; 35.3 for Condition I and 38.1 for Condition II. 
This latter finding also indicates that the effects of retroactive 
inhibition are greater on the phase of retention which is meas- 
ured by the recall method. 
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Section IV. ConcLtusions AND Discussion oF RESULTS 


Conclusions 


The foregoing experimental project has brought to light 
several findings worthy of summary and discussion. 

1. Directions to the subject informing him what to antici- 
pate at the time of learning,—1.e. presence of a recall; expecta- 
tion of interpolated list; information in regard to the possible 
effects of an interpolated list—any one or all of these pre- 
sented at the time of the original learning very definitely 
minimize the retroactive effects and increase the retention of 
the original material. 

2. There is a consistent trend in the data pointing to in- 
creasing degrees of retention with increasing amounts of infor- 
mation given at the time of learning, although the differences, 
taken individually, do not prove to be statistically significant. 

3. Directions urging the subject to make an effort to avoid 
the interference effects of the interpolated list result in a 
significantly greater retention than is obtained under any of 
the other conditions involving informatory directions. 

4. The presence of considerable retroactive inhibition has 
been shown to exist when the interpolated learning of similar 
nonsense material is introduced immediately prior to a recall 
of the originally learned nonsense material and twenty-four 
hours after the original learning. 

Such a finding, while in no sense new, would seem to fur- 
ther substantiate Robinson’s (5) conclusion that retroactive 
inhibition is not due to the disturbance of a perseverative 
process immediately after learning, but is a function rather 
of some other factor, such as associative interference, due to 
the similarity between the interpolated activity and the 
original learning. 

It might be interesting to duplicate this same experiment 
in all respects except to introduce the interpolated learning 
immediately after the original learning. This would decide 
the question as to whether interference with any hypothetical 
perseverative tendencies would in any way change the results 
here obtained. 

5. Retroactive effects are definitely more detrimental to 
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retention as measured by the recall method, than to retention 
measured by the relearning method. 

The findings of the present investigator corroborate in this 
respect the results of McGeoch (3), and are somewhat similar 
to the early experimental conclusions of Heine (2). However, 
the latter worked with recognition and recall as measures of 
retention, and hence it is only by considering recognition and 
relearning as being somewhat alike as measures of retention, 
in contrast to recall as a measure, that we can compare the 
two results. 

Discussion 


The main outcome of the experiment is, therefore, its 
demonstration that directions establishing a set or attitude 
at the time of learning have the effect of minimizing or cur- 
tailing retroactive inhibition. Some explanation of this 
decrease in so well recognized a phenomenon is necessary. 

The word ‘set’ may be variously defined. For instance, it 
may denote a bodily posture temporarily assumed by an 
organism, 1.¢. the runner toeing the line, or the cat crouching 
for a mouse. Here is an illustration of a preparation for 
action which involves observable changes in the adjustment of 
muscles. But set may also be defined as that mental attitude 
which is the result of specific work instructions, and which 
may or may not involve changes in muscular tonus, or the 
assumption of any particular bodily attitude, on the subject’s 
part. It is this latter definition or meaning of set with which 
we are dealing in this study. It seems probable that the direc- 
tions, acting as a stimulus, established an integrated neural 
pattern which energized or facilitated the learning process 
that was being carried on. 

As Bills (1) pointed out, the dynamogenic influence of 
simultaneous processes on one another has long been recog- 
nized, although the analysis of these processes and of the 
phenomenon of facilitation has not been investigated very 
adequately, either on the subjective or on the neurophysiologi- 
cal side. A theoretical explanation is offered in the drainage 
hypothesis of McDougall which, in rather vague terms, at- 
tempts to account for facilitation by assuming that an on- 
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going activity, such as learning, drafts the neural energy from 
impulses simultaneously aroused in adjacent neural systems. 
When we talk in terms of set we have not advanced very far 
from the Einstellung of Watt and the determining tendency of 
Ach and Woodworth. 

Inherent or implicit in the concept of determining tendency 
is the implication that the tendency operates superiorly to 
reinforce certain reactions, 1.¢. a determining tendency to add 


or subtract or multiply in the case of 2 strengthens one associa- 
tion—7, 3, or 10 respectively—and the implications are that 
such a set would operate in the formation of such associations. 
Peterson (4), in a study on the effect on learning of attitudes 
produced by the knowledge or lack of knowledge that recall 
would be required, obtained differential results for the two 
attitudes. He refers to the ‘will tolearn’ attitude, as discussed 
by Ebert and Meumann, and the fact that this attitude exerts 
an effect on the efficiency of memory. He states, furthermore, 
that particular adjustments of this attitude are manifested, 
1.¢e. if the subject endeavored to recall meanings rather than 
words, a better reproduction of meanings occurred and vice 
versa. In reference to his own experimental data, Peterson 
contends that the additional stimulus which produced the will 
to learn was the statement to students, before giving the list, 
that they would be asked to reproduce the words contained 
therein. This knowledge, Peterson thinks, produced, through 
various neural dispositions, a far more definite response atti- 
tude than was possible when no directions were given. It 
may be that our directions operated in a similar fashion. 

Mention has already been made, in the section devoted to 
the results of this experiment, of the greater retention, meas- 
ured by the savings method, in Condition II than in Condition 
I. Condition II gave knowledge of a recall. Here the atti- 
tude at the time of learning apparently facilitated the learning 
—in the sense of bringing about a more complete acquisition. 
It will be noted that the average number of trials to learn was 
approximately the same, in each case. 

In order that more definite proof might be obtained of the 
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energizing effect of directions on learning and on subsequent 
retention, eleven subjects were given directions similar to 
those under Condition VII and on the fourth experimental 
day were asked only fora recall. The condition thus obtained 
involved a more active learning attitude than any other condi- 
tion in the entire experimental procedure, with the exception 
of Condition VII. The average number ot trials necessary to 
learn the original list was 23.54; the number of trials to re- 
learn, 4.30. Expressed in terms of savings score there is a 
retention of 81.5 percent. ‘This amount of retention is rend- 
ered more conspicuous when compared with the retention of 
66.6 percent and 74.9 percent under Conditions I and II. 

There is also some indication of the facilitating effect of 
directions, apparent in the average number of trials required 
to learn the original lists under the several conditions of the 
experiment. Assuming, as seems justifiable, that our groups 
were approximately equal in respect to the capacity or ability 
for learning meaningless material, the tendency for the aver- 
age trials to learn, to decrease as the directions become more 
complete would seem to indicate the greater effectiveness of 
such directions in energizing the learning. Reference to 
Table 2 in the section on results will make more clear the above 
inference. 

If we assume that the interpolation of similar learning ma- 
terial introduces a disruptive force affecting the retention of 
the original learning, the natural expectation would be that 
the more efficiently learned material would be more easily 
retained under interference conditions. It would also be 
anticipated that the subjects possessing some expectation of 
the requirement of a recall, and of the introduction of inter- 
polated material, would be less influenced by the actual 
presentation of the latter. In other words, preparation for a 
future contingency greatly lessens the startling or upsetting 
effect of its actuality. 

Reference to Table 2 will illustrate that retroactive effects 
are more apparent when the number of syllables recalled is 
taken as a measure of retention. The differences between 
the several conditions of the experiment are also greater when 
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this criterion is employed. This is to be explained largely in 
terms of the interference which occurs when a recall is de- 
manded after the learning of the interpolated material. Con- 
siderable confusion is evident in the lists as recalled at this 
time. For instance, the transfer causing retroaction may be, 
in one case, a shift from original memorizing to interpolation; 
and, in another, a transfer from interpolated to original ma- 
terial. Such interference would make itself apparent during 
the subject’s immediate recall more than in his trials to re- 
learn since the exposure of the original list and the subsequent 
presentations allow an opportunity for the interference effects 
to wear off. 

In our discussion of the active attitude or set, we have 
assumed that it acted directly upon the learning process to 
energize it. But there is an alternative possibility that the 
effect of set or attitude on retention could be explained on the 
basis of a theory of perseveration. Although the present 
experimental procedure, since it entails a lapse of twenty-four 
hours before the recall, does not preclude this hypothesis, the 
writer feels such a theory to be highly improbable.’ It 
might be suggested that subjects under the directions of a 
recall, with knowledge of an impending interpolated list, or 
with an active set to overcome the interference effect of such a 
list, were put in a condition (mild anxiety, certain amount of 
tension, desire for a good performance) that favored the per- 
severating, and ‘setting in’ of these lists of nonsense syllables. 
That is, the directions created a background of mental events 
which might have augmented the perseverative tendencies. 
Such a conclusion is not entirely inconsistent with material 
already at hand concerning a tendency to greater persevera- 
tion under conditions of anxiety, worry or remorse. Watt’s 
theory (7) of reluctancy fits into this interpretation also. 

Although the field of neural organization is still relatively 
uninvestigated, explanations eventually should be forthcom- 
ing; and the present experiment may be considered as a small 
offering along the line of descriptive evidence on the effect of 
mental set in relation to retroaction. 


§It is hoped that future experiments will obtain definite and conclusive data on 
this point. 
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The words of Watt are still a challenge: 


‘“‘If we wish individuals to respond to a series of single words in a specified way 
and they can do this, and firmly resolve to do it, we shall find that no number of 
repetitions will make strong enough the associations between the words of the series 


and other words, so as to bring the wrong words out against the intention of such 
individuals.” 


(Manuscript received April 7, 1932) 
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HABITUATION AND PERSEVERATIONAL 
CHARACTERISTICS OF TWO FORMS 
OF INDIRECT SUGGESTION 


BY CLARK L. HULL ann MILTON C. FORSTER 
Yale University 


I 


Whether two distinguishable natural phenomena are essen- 
tially alike or different is always a nice question. Queries 
concerning essences usually have little meaning and are capa- 
ble of no very significant answer unless the point of view of 
the questioner is known. The prime object of the present 
investigation is to clarify certain ambiguities concerning the 
nature of suggestion as related to hypnosis. Various phe- 
nomena such as hypnosis, direct waking suggestion to move- 
ment, and the forms of indirect suggestion represented by 
Binet’s progressive weights and his progressive lines have 
long been classed without distinction as suggestion. Are all 
these phenomena essentially allied to hypnosis, or are some of 
them fundamentally unrelated to it? Experimental evidence 
is accumulating which raises some presumption in favor of the 
view that hypnosis and direct waking suggestion to movement 
are of the same general nature in the sense that the behavior of 
the two seems to conform to the same basic pattern. It has 
been shown by Krueger (4), for example, that falling into the 
hypnotic trance as indicated by the rate of eye closure is 
positively facilitated by repetition or practice. Hull and 
Huse (3), Williams (7), and especially Patten, Switzer, and 
Hull (5) have shown that direct waking suggestion is also 
positively facilitated by repetition. On the other hand, cer- 
tain a priori considerations lead to the view that the progres- 
sive-increment forms of indirect suggestibility are essentially 
distinct from hypnosis and that one among several significant 
distinguishing characteristics might well be that practice or 
repetition should negatively influence susceptibility to the 
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tendency in question. Some observations made by Williams 
(7, 40 ff.), in the course of an experiment set up primarily to 
solve an entirely different problem, tend more or less to corrob- 
orate this hypothesis. If this distinction can be definitely 
established it will tend considerably to simplify the problem 
of defining suggestion (8, 78) and will ultimately aid in the 
clarification of the relationship of suggestion to hypnosis, at 
present in a distinctly unsatisfactory state. 


II 


It was believed that in the present investigation clear-cut 
results would be more readily obtained with subjects who 
showed an intermediate amount of suggestibility, since sub- 
jects showing complete resistance to suggestion could not 
display any increase in resistance as the result of repetition, 
and subjects showing complete positive response could not 
possibly manifest any change in the direction of increased 
susceptibility through practice. Ina preliminary experiment 
with university students it was found that under the condi- 
tions of the present experiment they were so slightly respon- 
sive to the forms of suggestion employed that they were prac- 
tically useless as subjects. It has long been known, however, 
that children are far more responsive to all kinds of suggestions 
than are adults. Accordingly, the subjects finally chosen for 
the present investigation were boys between the ages of 10 and 
It years. Because they were living in an orphanage they 
were readily available for experimental purposes. Fortu- 
nately the greater part of the subjects had received intelligence 
tests. These scores, supplemented by the careful observa- 
tions of the experimenter, indicated that those children finally 
chosen for the experiment were entirely normal individuals. 

The forms of suggestion chosen for investigation were the 
classical ones originally devised by Binet (1), the two based 
respectively on progressive weights and progressive lines. 
With minor exceptions the apparatus and technique of ad- 
ministration were the same as those employed by Williams 
(7, 17 ff.). The fifteen weights were made from rectangular 
pill boxes, the first four weighing 20, 40, 60, and 80 gr., and the 
remaining eleven weighing 100 gr. each. The boxes were 
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numbered beneath, but all appeared to be exactly alike to the 
subject’s view. ‘They were always arranged in the same order 
on a ledge which was at the height of an ordinary table. 

The subjects were given oral instructions at the beginning 
of each day’s experiment, as follows: 

I want you to lift this weight (pointing to the first weight) and then this weight 
(pointing to the second weight) and tell me whether this one here is heavier or lighter 
or just the same weight as that one (pointing again to the first weight). Then lift 
this one (pointing to the third weight) and tell me whether this one is heavier or 


lighter or just the same weight as that one (pointing to the second weight) and then 
do the same for the rest of these. 


In order to make as certain as possible that the subjects fully 
understood the task, a preliminary practice series was given in 
which the weights commonly used in the Binet intelligence 
tests were used. Practice with these weights arranged in 
different orders was continued until it appeared clear that the 
subject was really reacting to the differences in weight in- 
volved rather than merely guessing at random. Several 
subjects who appeared to react carelessly or automatically 
were rejected. Each subject in the experimental group made 
the series of fourteen judgments in the experiment proper 
three times at the same hour on each of ten consecutive days. 
Nine subjects made the three series in immediate succession, 
but with five, each series was separated from the next by ap- 
proximately six minutes of play. At the conclusion of each 
series of judgments the subjects were encouraged by the ex- 
perimenter, who would say, “That is good! Now do it 
again,” or, ““That is fine! Now do it again and do your very 
best this time,” and the like. 

The apparatus employed in the parallel experiment of the 
progressive lines was the one used by Williams (7, 17). It 
consisted of a board screen in which was cut a rectangular 
hole 2 cm high and 11.5 cm wide. Behind the screen was an 
attachment for mounting a cardboard disk upon which were 
placed at intervals fifteen lines drawn in India ink. The 
lengths of the first four lines were 12, 24, 36, and 48 mm, and 
the remaining eleven, 60 mm each. The disk could be 
revolved so that the lines would appear at the window one at a 
time, in the order given above. The lines were presented 
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uniformly 1 cm from the bottom of the aperture, but to dis- 
courage possible tendencies to stereotyped reaction the lateral 
positions of the lines as they appeared in the aperture were 
systematically varied. It is well known, however, that the 
setting in which a line is viewed greatly influences its apparent 
length. To equate this factor of position, nine different disks 
were drawn up in such a way that each line of the series would 
appear in each lateral position at the window an equal number 
of times. The disks were systematically rotated among the 
several subjects in such a way that for the group as a whole no 
constant error due to position at the window would enter the 
judgments of length at any point in the judgment series. 

The instructions to the subjects in this portion of the exper- 
iment were also given orally, as follows: 

I am going to show you some lines and I want you to tell me if the second line 
that you see is longer or shorter or just the same length as the first line that you saw. 


Then I’ll show you another line, and you tell me whether that one is longer or shorter 
or just the same length as the line you saw just before it. 


A preliminary practice series was given to make sure that the 
subject understood the nature of his task. In the experiment 
proper three series of judgments were given each subject each 
day, and the experiment was continued for ten consecutive 
days, exactly paralleling the procedure with the weights. A 
different sequence of disks was employed on each day and 
with each subject. The same type of encouragement was 
given these subjects as those who served in the weights project. 

The experimental work of the entire investigation was con- 
ducted in a small room at the orphanage. [ach child was 
called into the experimental room and made his judgments 
individually. While awaiting their turn to serve in the experi- 
ment, the children amused themselves in an adjoining room 
with puzzles and games. From such information as the ex- 
perimenter was able to gather, the children appear to have 
been so much engrossed with their amusements that they 
indulged in very little or no conversation concerning the 
experiment. Naturally they were given no hints as to the 
nature of the problem under investigation. 

No child acted as subject on more than one form of sug- 
gestion. 
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III 


In both forms of suggestion under consideration any nor- 
mal subject will judge the second, third, fourth, and fifth 
presentations as greater than the immediately preceding one, 
for the reason that they are obviously so in fact. As the re- 
sult of this preliminary activity, there persists over into the 
remainder of the series a tendency to judge each succeeding 
presentation as ‘heavier’ (or ‘longer’), even though they 
really are all alike. In this lies the evidence of suggestibility. 
Actually, however, the subject frequently also responds with 
‘same’ or ‘lighter.’ Accordingly, the results of fourteen sub- 
jects who completed the weights experiment are presented in 
Table I from all three points of view. An inspection of the 
row of means shows that there is a very marked and consistent 
tendency for the judgments of ‘heavier’ among the 100-gram 
weights to grow progressively less with increasing practice. 
This constitutes a preliminary indication of a decided negative 
practice effect. There is evident also a parallel tendency for 
the number of ‘lighter’ and ‘same’ judgments to become 
greater. 

It is common in determining the extent of suggestibility in 
such experiments to consider only the judgments of ‘heavier’ 
and to ignore more or less completely the other two types of 
responses (2, 341). It is clear, however, that a subject might 
give a fairly appreciable score of ‘heavier’ judgments and yet 
have no tendency to suggestion whatever. He might, for 
example, on a given occasion judge 33 per cent ‘heavier,’ 33 
per cent ‘lighter,’ and 34 percent ‘same.’ Insucha case there 
is just as much reason to conclude that there is suggestion in 
the direction of judging the weights too light as to judge them 
too heavy. Asa matter of fact, such a situation presents no 
evidence of suggestion of any kind. Yet a naive interpreta- 
tion of the ‘heavier’ scores considered alone might lead to the 
completely erroneous belief that a very appreciable amount 
of suggestibility is present. It is only as the three types of 
judgments as a whole are shifted to one side or the other from 
a symmetrical arrangement that any evidence of suggestion 
can appear. By the same token, it is evident that the amount 
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of the tendency in question is measured by the magnitude of 
this shift. 

These considerations led to the development of a method 
for determining from such judgment data the amount of 
suggestibility. ‘The method proceeds upon the assumption, 
usually made in treating reactions of this kind, that all the 
judgments taken together are distributed according to the 
normal ‘curve of error.’ A convenient table of this function 
in terms of the standard deviation may be found in Thorn- 
dike’s Mental and Social Measurements (6, 198). The method 
may be illustrated by the computations for determining the 
composite suggestion tendency on day one. ‘Table I shows 
that this group averaged on this day 76 per cent ‘heavier’ 
judgments, 14 per cent ‘same,’ and 10 per cent ‘lighter.’ 
Since the middle of the area of ‘same’ judgments should coin- 
cide with the middle of the distribution of judgments as a 
whole if there were no distortion, the extent of a distortion is 
shown by the distance from the one to the other in cases where 
they do not coincide. The 76 per cent ‘heavier’ judgments 
occupies the entire 50 per cent on the right wing of the distri- 
bution (Fig. 1) and also the inner 26 per cent of the left wing. 
Reference to the table shows that 26 per cent of the statistical 
population occupies a space in this region of the distribution 
corresponding to .71 ¢ on the base line. Next there is added 
the 14 percent of ‘same’ judgments to the 26 per cent of ‘heav- 
ier’ already on the inner side of this wing, making a total of 
40 per cent. Reference to the table shows that 40 per cent 
occupies a space from the center outward corresponding to 
1.280 on the base line. Thus one margin of the ‘same’ 
judgments falls at .71 0, and the other at 1.28 a, both on the 
left wing of the distribution. The range of the ‘same’ judg- 
ments is consequently the difference between 1.28 and .71, or 
.57¢. The mid-point of the ‘same’ judgments is therefore 
one-half of .57, or .28 o distant from the margin of the heavier 
judgments. But since the margin of the heavier judgments 
is .71 o from the center of the distribution (the normal posi- 
tion for the mid-point of the ‘same’ judgments) the latter is 
.71 6 + .28 ¢, or .gg a from its normal position. Accordingly, 
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.99 o is the extent of the displacement of the mid-point of the 
‘same’ judgments, in this case presumably due in the main 
to the indirect suggestion from the preceding series of weights 
which really were progressively heavier. This outcome of the 
computation shows, then, that on the first day of practice 
there was a very considerable suggestion effect. 
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Fic. 1. A diagram illustrating the method of determining the amount of distortion 
in judgment of lifted weights due to suggestion. See text for explanation. 


Similar computations were made for the remaining nine 
days of practice. These values are shown graphically in 
Fig. 2. An inspection of this graph reveals a typical learning 
curve with the usual tendency to flatten out as practice con- 
tinues. As anticipated, the influence of repetition weakens 
the tendency to suggestion rather than strengthens it. Dur- 
ing the ten-day period the rather strong tendency of the first 
day is gradually reduced until on the last day it is only one- 
third of its original strength.! 

Unfortunately only three subjects completed the lines 
experiment by the technique paralleling that of the weights. 
The results from these individuals are summarized in Table 


1This ignores the possibility that other factors may be operative here. It is 
possible, for example, that the well-known tendency to call the second of a pair of 
lifted weights heavier, observed in the classical experiments concerned with Weber’s 
law, may be partly responsible for the distortion of judgment still present on the 
tenth day. This problem is capable of straightforward solution, but the limited 
purpose of the present investigation does not require it. 
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II. While such a limited quantity of data does not justify 
the statistical analysis given the progressive-weights results, 
an inspection of this table reveals some indications of a nega- 
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Fic. 2. Composite practice curve in responding to the progressive-weights form 
of indirect suggestion, in terms of the o of the distribution of judgments. Note the 
decided negative practice effect. 
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tive practice effect. Final decision on this matter must, 
however, await a more adequate volume of experimental 
results. 
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TABLE II 


SUMMARY OF THE MEAN Scores OF THREE SUBJECTS 10 To 11 YEARS OF AGE ON 
THE ProGREssivE Lines EXPERIMENT, WITH ORAL RESPONSES 















































No. of day I 2 3 4 5 6 7 8 9 | 10 

Mean no. of ‘longer’...... see: §.0| 4.1 | 3.8 | 3.1 | 2.8 | 2.6 | 2.2 | 2.7] 2.2 | 2.8 

Mean no. of ‘same’..............4-. 2.8 | 4.0| 4.0] 4.4] 5.4 | 6.3 | 7.0] 5.8 | 6.8 | 6.0 

Mean no. of ‘shorter’.............+.. 2.2) 1.9] 2.2) 2.4/1.8] 1.1] .8]1.6] 1.0] 1.2 
IV 


It happens that the form of the experiment with the lines 
permits of another and entirely different technique of re- 
cording the subjects’ judgments, which throws some light on 
the nature of the suggestion process. Ten subjects were 
asked, instead of making a comparison of the successive pairs 
of lines with oral responses, merely to draw on a piece of ruled 
paper with a pencil a line equal to each as presented in the 
aperture of the screen. The lines drawn from one disk were 
removed before those on the next disk were presented. The 
instructions to the subject were: 


I am going to show you some lines and I want you to draw a line here which is 
just as long as the line you see. 


Otherwise the experiment was conducted exactly as that de- 
scribed above in which oral reactions were required. 

In view of the decided negative practice effects shown in 
the progressive weights experiment it might perhaps be ex- 
pected that the mean length of the lines § to 15 drawn from 
day to day would gradually become less and less. Actually, 
no such tendency was found. The mean length of these lines 
in mm for the ten days was as follows: 54.1, 54.5, 56.0, 56.7, 
55-1, 54.6, 53.8, 55.1, 55-3, 54-9. 

In an attempt to secure some light on this paradoxical 
situation, we converted the line measurements into scores 
presumably equivalent in some sense to the scores from oral 
reactions to the same stimuli. If the second of two successive 
lines was drawn longer than the first by more than 4 mm it was 
scored ‘longer,’ if shorter by more than this amount, it was 
scored ‘shorter.’ If they did not differ by more than this 
amount they were scored ‘same,’ on the assumption that they 
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would not have been discriminated. The results of this pro- 
cedure are summarized in Table III. An inspection of the 
row of longer values, if considered alone, might easily be mis- 
taken as showing a distinct negative practice effect. A 
careful statistical examination of these converted scores by 
methods already described for oral judgments failed to reveal 
a practice effect of any kind. The indication seems to be 
that this form of progressive lines suggestion is very slightly, 
or not at all, influenced by a very considerable amount of 


repetition. 
TABLE III 
Mean Scores FOR TEN SUBJECTS IO TO II YEARS OF AGE ON THE PROGRESSIVE- 
Lines ExpeRIMENT WHERE THE LINES DRAWN BY THE SUBJECTS WERE 
MEASURED AND THEN CONVERTED INTO THE EQUIVALENTS 
oF VERBAL Reactions, ASSUMING THE THRES- 
HOLD OF DISCRIMINATION TO HAVE 












































BEEN 4 MM. 
No. of day I 2 3 4 5 6 7 8 9 10 
Mean no. of ‘longer’....... 3-90 | 3.47 | 3.40 | 3.23 | 3.23 | 2.77 | 2.70 | 3.03 | 2.97 | 2.53 
Mean no. of ‘same’........] 3-40 | 4.27 | 5.13 | 5-27 | 5.53 | 5.90 | 6.10 | 5.97 | 5.83 | 6.60 
Mean no. of ‘shorter’... ...}| 2.70 | 2.27 | 1.43 | 1.50 | 1.23 | 1.40] 1.20] 1.00] 1.17] .90 
V 


We may next inquire what evidences the two forms of 
indirect suggestion present of perseverational tendencies. 
Pursuing the analogy of the Patten, Switzer, and Hull experi- 
ment (5), we will first examine the relation of the strengths of 
the suggestion tendencies on the three tests per day through- 
out the practice series. Too few subjects were included in 
the progressive-lines experiment by the oral-response method 
to make profitable an examination of these data for this 
purpose. The statistically treated results of the judgments 
of the fourteen subjects in the progressive-weights experiment 
are summarized, from this point of view, in Table IV. 

An inspection of the main body of this table shows that 
if any tendency to perseveration from one series of judgments 
to the next exists it is exceedingly slight and unstable—the 
means for the three trials interlace so much that it would be 
difficult to distinguish one from the other if they were to be 
plotted as were the results of the direct suggestion by Patten, 
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TABLE IV 


SUMMARY OF THE ComposiTE AMOUNT OF SUGGESTIBILITY OF 14 Boys By Days on 
THE PROGRESSIVE-WEIGHTS EXPERIMENT FOR THE THREE 
RESPECTIVE TRIALS Eacu Day 


The unit of measurement is the standard deviation of the judgment distribution. 














Means | Means | Means 

Day I 2 3 4 5 6 7 8 9 10 | days days days 

’~“3 6-10 sso 

First trial......| .87] .91 | .83 | .66| .43 | .47] .39| .36]| .341.31] -74 37 .56 
Second trial....] .97 | .85| .69| .52| .61]| .45 | .26].33| .40].38] .73 .36 55 
Third trial. ... .| 1.18 | .84] .88] .81].40].52|.44|.54|.23|.42] .82 43 .63 












































Switzer, and Hull. Something roughly approximating this, 
however, may be seen at the right of the table where the 
pooled results of the first and second five-day periods of the 
practice are given. ‘There emerges here a certain consistency, 
though the second trial shows a slight shift in a direction 
opposite that of the third. The amount of the tendency to 
perseveration is slight, only .o7¢ where the initial score of the 
day was .730 or less than 10 percent. It may be noted, inci- 
dentally, that this change within a day’s practice is in the 
direction of an increase in suggestibility. This is opposed 
to the tendency of change from one day to the next, which 
is in the direction of a decrease. This means that none of 
the mean (negative) practice effect in this function takes 
place during the practice itself, but seems, somehow, to take 


place during the twenty-four-hour interval between practice 
periods. | 


VI 

The form of the progressive-increment suggestion experi- 
ments permits of a second and fairly distinct quantitative 
approach to the phenomenon of perseveration. As already 
pointed out, the basic mechanisms of the suggestion responses 
in these cases presumably is the tendency of the organism 
which has reacted in a certain way to a series of gross differ- 
ences to persist or perseverate into a stimulus situation where 
these reactions are no longer appropriate. But since the 
series goes on for a considerable distance after the termination 
of the exciting cause, it might be anticipated that the tendency 
to error would be at a maximum at the sixth presentation 











712 CLARK L. HULL AND MILTON C. FORSTER 


and that it might gradually fade out as the successive judg- 
ments were made. In that event, a graph representing the 
magnitude of the suggestion effect through the portion of the 
series where the presentations were actually equal ought to 
represent the course of the perseverational tendency. Such 
a graph for the fourteen subjects of the progressive-weights 
experiment is shown in Fig. 3. Superficially the persevera- 
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Fic. 3. Graph showing the composite tendency to judge the several weights of 
the series heavier than the one preceding. The fading out of the perseverational 
tendency is shown by the gradual approach to the horizontal of the graph for lines 6 
to 15. 


tional tendency is represented (for weights 6 to 15) by the 
distance from the graph to the base line. This tendency is 
nearly half as great as that on weights 2 to 5, where differences 
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of 20 gm. actually exist. The gradual approach of this curve 
to the horizontal shows the progressive fading out of the 
perseverational tendency. As already pointed out, it is 
probable that a portion of what appears to be suggestion here 
in reality is due to a tendency to call the second of a pair of 
weights heavier regardless of suggestion. 

A still different view of perseveration may be obtained by 
plotting the composite curve of the lengths of the fifteen lines 
as drawn by the ten subjects on each of the ten days of 
practice. Such a graph is presented in Fig. 4, where it may be 
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Fic. 4. A graph representing the perseveration of the composite tendency to 
draw successive lines longer after the lines to be reproduced no longer increase. The 
first five lines presented were consistently underestimated. If the remaining lines 
had been underestimated in the same way, lines 6 to 15 would have fallen along the 
broken line labeled ‘no-suggestion level.’ The tendency to increase the length of the 
lines gradually fades out with successive trials. 


compared with the lengths of the lines as actually presented. 
There it may be seen that though the subjects as a group 
tended to underestimate the lines presented, their drawings 
increase in length with remarkable uniformity up to the fifth. 
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At this point, if there were no influence of suggestion, the 
curve presumably should have become horizontal, 1.¢., it 
should have followed the course shown by the broken line. 
Instead we observe that the sixth line is increased nearly half 
as much over the fifth as the fifth is over the fourth—surely a 
very considerable influence of perseveration. In general the 
tendency to increase the lines persists through the entire series 
of ten reactions, though, as might have been expected, the 
tendency grows progressively weaker with the increased dis- 
tance from its source, just as was observed in the case of the 
progressive weights. The remarkable smoothness and uni- 
formity of this graph is distinctly convincing as to the genuine- 
ness of the tendency and its point of origin. 


VII 

In summarizing the results of the present experiment it 
may be said that the two forms of indirect suggestion investi- 
gated definitely differ from the forms of hypnosis and direct 
waking suggestion so far examined, in not showing a positive 
practice effect. The progressive weights display a very 
marked practice effect, but in a negative direction. It is 
possible that the suggestion responses to the progressive lines 
by the oral technique also are influenced in a negative manner 
by repetition, but the quantity of experimental data available 
is not adequate to permit of anopinion. The progressive lines 
by the reproduction technique appear to be nearly, if not 
quite, destitute of any practice effects in either direction. 

In respect to perseveration effects, it may be said that the 
progressive weights manifest a slight tendency to yield 
greater suggestibility from trial totrial. In this respect there 
is a resemblance to the results on direct postural waking sug- 
gestion. In the latter case, however, this tendency was in the 
same direction as the permanent practice effects and may be 
nothing more than the transitory component of such practice 
(5). The case of indirect suggestibility, however, presents the 
paradox of the perseverational or immediate practice effects 
being in a direction opposite that of the permanent effects. If 
this apparent tendency should prove to be a genuine one, it 
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will assist materially in differentiating perseveration as an 


experimental entity distinct from the phenomenon of ordinary 
practice effects. 


The bearing of these results, particularly those concerned 
with permanent practice effects, seems rather definitely to 
separate indirect suggestion from the family of reactions akin 
to hypnosis. This should greatly simplify the problem of 


defining suggestion as related to hypnotism—a thing greatly 
to be desired. 


(Manuscript received June 4, 1932) 
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FACTORS DETERMINING THE RELATIVE 
EFFICACY OF THE WHOLE AND PART 
METHODS OF LEARNING 


BY ALONZO J. DAVIS ann MAX MEENES 
Howard University 


PROBLEM 


Investigations of the whole and part methods of learning 
have yielded conflicting results. Some have shown the whole 
method superior, others the part method.! 

With the emphasis on statistical reliability of results, 
conclusions on the basis of results of investigations which 
have shown the pure part method superior to the whole and 
progressive part methods have been thrown open to question. 
With the application of criteria of statistical reliability, all 
experiments in the field of verbal learning which are statisti- 
cally reliable have shown that the whole method and modifica- 
tions of the pure part method are superior to the pure part 
method.? 

Investigators have not satisfied themselves simply with 
description of superiority. ‘They have sought to find out what 
factors are responsible for this superiority both by hypothesis 
and experimentation. 

Pechstein subjected to experiment several hypotheses in 
an endeavor to explain why the whole method and modifica- 
tions of the pure part method prove superior to the pure part 
method.? His results were negative. Hence he rejected the 
hypotheses which he had tested and formulated a new hy- 
pothesis: namely, that all the waste in pure part learning oc- 
curs in the act of connection. This, he says, is traceable 
almost entirely to the influence of place association. 


1McGeoch, G. O., Whole-part problem: General review and summary, Psychol. 
Bull., 1931, 28, No. to. 

2 McGeoch, G. O., tbid. 

3 Pechstein, L. A., Whole os. part methods in motor learning, Psychol. Monog., 
1917, 23, No. 99. 
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In defining place association, he says, “‘ Place association 
refers to the definite location of an element of a problem in 
reference not only to the remaining details of that problem, 
but to the entire environment. In the case of rote learning a 
certain syllable is learned in reference to its antecedent and 
consequent (immediate association) and to the remainder of 
the terms (mediate association). It is hereby assigned a 
definite position in the word series. This places it in a con- 
ceptual scheme. It is located at a definite number of syllable 
intervals from the introductory term and the terminal one of 
the list. It is reached after the same time expenditure in each 
trial and is followed by a constant time span for the comple- 
tion of the presentation. Both spatial and temporal factors 
are concerned in establishing the positional relationships.” ‘ 

In pure part learning the factor of place association does 
not operate so effectively as in whole learning or in modifica- 
tions of the pure part method of learning. The superiority 
of the whole method and modifications of the pure part method 
Pechstein attributes to this difference in effectiveness of 
place association. 

Crafts has more recently attempted to test the hypothesis.® 
In his investigation of card sorting in which non-serial reac- 
tions are involved, hence in which there is no place association, 
or only a negligible amount of it, he found the whole method, 
and modifications of the pure part method to be superior to 
the pure part method. Where place association does not op- 
erate the pure part method—it would seem—should at least 
prove of equal efficacy if Pechstein’s hypothesis is correct. 

Brown, working in the field of verbal learning, found the 
whole method to be superior to the pure part method.® He 
used the paired associates method in which there were ‘tied 
up’ two-syllable nonsense words with English words. His 
test series consisted of the elements in a ‘deranged’ order. It 
would seem that if place association operated in the learning 
of the material by the whole method, the whole method would 

4 Pechstein, L. A., op. cit., p. 29. 

§ Crafts, L. W., Whole and part methods with non-serial reactions, 4m. J. Psychol., 
1929, 41, 543-563. 

* Brown, W., Whole and part methods in learning, J. Educ. Psychol., 1924, 15, 
229-233. 
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have proved at a disadvantage where the place associations 
were not operative. 

In the two experiments cited, the emphasis has been placed 
on the reproduction of the material to be learned, or more 
specifically, on the test of reproduction. Would the same 
results obtain if the materials to be learned by the whole and 
pure part methods were equated with regard to the factor of 
place association? We shall attempt to answer this question 
by experimentation. 

Other factors have been mentioned as probably determin- 
ing the relatively greater efficacy of one method than the other.’ 
Hunter mentions the character of the stimulus material, 
individual differences, the subject’s age, the amount of train- 
ing which he has had, and some factors as yet undetermined, 
as operative.® 

With reference to the factor of individual differences, Reed, 
on the basis of his investigation in which the part method 
proved superior to the whole method came to the conclusion: 
‘The most economical method of learning is in the last analy- 
sis a question for the individual learner.” ° 

Crafts mentioned a list similiar to Hunter’s.!° In addition, 
he names the subject’s habitual method and the subject’s 
learning capacity. McGeoch has recently reported an investi- 
gation, in which she compared the abilities of gifted and nor- 
mal children to learn by the whole, progressive part and pure 
part methods. She concluded: “The I. Q. is a factor condi- 
tioning the relative efficiency of the whole, progressive part 
and pure part methods in learning. Its efficacy as a factor is 
apparent, at least, in a comparison of gifted and normal 
children.” #! 

McGeoch has recently made a critical review of all data on 
the problem.” She listed the following factors: 


7 Hunter, W. S., Experimental studies of learning, in Murchison, C., ed.: Founda- 
tions of Experimental Psychol., 1929, 564-627. 

§ Hunter, W. S., op. cit. 

® Reed, H. B., Part and whole methods of learning, J. Educ. Psychol., 1917, 2, 
315-346. 

10 Crafts, L. W., op. cit. 

4 McGeoch, G. O., The intelligence quotient as a factor in the whole-part problem, 
J. Exper. Psychol., 1931, 14, 358. 

12 McGeoch, G. O., Whole-part problem: General review and summary, Psychol. 
Bull., 1931, 28, No. 10. 
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1. Subjects: age, training, memorizing ability and intelligence. 

2. Material: type, nature, difficulty and length. 

. Practice: amount, distribution and nature of practice 
periods. 

Form of practice periods used. 

Method of measuring learning efficiency. 

Method of measuring retention efficiency. 

Length of interval. 


Ww 


v PY? 


Reed, who found the part method superior to the whole 
method, suggested that the part method adjusted the material 
to the memory span of the subject.” 


The present experiment is limited to a study of the follow- 


ing factors: place association, sex, age, capacity and habitual 


method of learning. 
SUBJECTS 


The subjects were 104 college students enrolled in courses in psychology. The 
subjects were divided into two groups to be known as Group A which consisted of 
54 students, and Group B which consisted of 50 students. 


APPARATUS AND MATERIAL 


(a) The Malmberg Exposure Apparatus was used for the presentation of the 
material. This apparatus permitted a uniform rate of presentation of the material 
and also an equal total time for the material presented by the whole and part methods. 

(b) The material consisted of 264 stimulus cards which were printed uniformly 
and legibly. The stimulus cards were 10 X 7 inches. The letters were 34 inch X 1 
inch. Upon each of 240 of the stimulus cards was printed a simple di-syllabic English 
word and a nonsense syllable. The nonsense syllable was printed above the English 
word in each case. Upon each of the remaining 24 stimulus cards was printed one 
of each of the di-syllabic English words referred to above. 

The di-syllabic English words were chosen so as to form two sentences. The 
words used in Series I (referred to below) when arranged in the serial order of a sentence 
shall be referred to as Sentence I. The words used in Series III (referred to below) 
when arranged in the serial order of a sentence shall be referred to as Sentence II. 

The stimulus cards were arranged in 8 series. Series I consisted of 12 sets of 
stimulus cards, each set of which contained 10 stimulus cards. Each stimulus card 
of a given set bore the same English word and nonsense syllable. Series II was 
composed, like Series I, except that a different group of English words and nonsense 
syllables was used. Series III consisted of 1 card from each of Series I arranged in 
the serial order. Series IV consisted of 1 card from each set of Series I] arranged 
in the serial order of Sentence II. Series V consisted of 1 card from each set of Series I 
arranged in a non-serial order determined by chance. Series VI composed of 1 card 
from each set of Series II arranged in a non-serial order determined by chance. Series 





13 Reed, H. B., Repetition of Ebert and Meumann’s practice experiment on 
memory, J. Exper. Psychol., 1924, 15, 107-115. 
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VII consisted of the English words only of Series I arranged in the serial order of 
Sentence I. Series VIII consisted of the English words of Series II arranged in the 
serial order of Sentence II. 


Sentence I and its nonsense syllable associates: 


NIF TOH VED WEC BAJ HEG 
DAWSON COUNTY BANKERS INFORMED OVER NINETY 
FEH KAG LAR MOF BOC VUT 


COLLEGE STUDENTS ABOUT KEEPING DAILY BUDGETS 


Sentence II and its nonsense syllable associates: 


HAJ FUT DOP LUM REB KOL 
CLEVER SAXON OUTLAWS DECEIVED MANY SECRET 
CEK NUG FOM LUD KEV MIB 
POLICE AFTER HAVING STOLEN SIXTY HORSES 


(c) The Otis Self-Administering Test of Mental Ability was given to all subjects. 
(d) Test booklets were constructed of fifteen numbered sheets of paper for attempts 
at reproduction of the nonsense syllables. 


PROCEDURE 


A week before the experiment proper, the Otis Self-Administering Test of Mental 
Ability was given to the subjects. After the mental test they were asked to indicate 
their habitual method of learning by writing either the word ‘part’ or ‘whole.’ This 
information was obtained only after a definition by example of the whole and part 
methods of learning had been given. Record was not kept of those who used neither 
method alone. During the same period the subjects were made familiar with the 
paired associates method of presentation by exposing on the Malmberg Exposure 
Apparatus some standard cards (cards which accompany the apparatus, obtained 
from C. H. Stoelting Co.). Tests of reproduction were made by covering up with a 
piece of cardboard one of the associates on each stimulus card and by having the 
subjects reproduce them when their paired associates were exposed. 

A week later Group A was presented with Series I by the part method, the English 
words being in the serial order of Sentence I. Group B was presented with Series III 
by the whole method. A week after this, Group A was presented Series IV by the 
whole method; Group B was presented with Series II by the part method, the English 
words being in the serial order of Sentence II. By this arrangement the benefits 
accruing to both the whole and part methods after practice were equalized. 

The data from the two sentences presented by the whole method were pooled; 
the data from the same two sentences by the part method were pooled. By this 
arrangement, any difference in the difficulty of the material is assumed to affect the 
whole and part methods equally. The diagram shows how the data were pooled 
and the order of presentation. 





Group A—| Part Method—Sentence I 








Group A—| Whole Method—Sentence II 
Group B—}| Whole Method—Sentence I 




















Group B—| Part Method—Sentence II 
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Part METHOD 


Series I was presented to Group A. The Malmberg Exposure Apparatus was so 
regulated that each stimulus card was exposed 0.5 second. The subjects were directed 
to learn the nonsense syllable associate of each English word in such a way that they 
would be able to write them down in their test booklets when they were presented 
with the Test Series, Series VII. When all of Series I had been presented, the Test 
Series was presented. The cards in this series were exposed by the experimenter who 
held them to the view of the subjects to a practiced count of about five seconds. 
The subjects were asked to reproduce the nonsense syllables on sheet one of the test 
booklets. 

Series II was presented to Group B in the same manner as Series I was presented 
to Group A. The testing was likewise the same, with, of course, the appropriate 
Test Series, Series VIII. 

Immediately upon completion of the test of reproduction, after exposure of the 
material by the part method, Series V or VI was presented. Series V followed Series I; 
Series VI followed Series II. It is to be remembered that Series V and VI consisted 
of cards arranged in a non-serial order determined by chance shuffling. Presentation 
and tests continued until fifteen attempts at reproduction had been made in all. 
The number of trials necessary for complete mastery of the nonsense syllables was 
scored and the average calculated. The second consecutive trial upon which a perfect 
reproduction occurred was considered as complete mastery. 

The part method together with the whole method consisting of the stimulus 
cards arranged in a non-serial order shall hereinafter be referred to as the combination 


method. 
Wuote Metuop 
Series III, a set of stimulus cards the English words of which composed Sentence I, 
was presented 10 times in succession toGroup A. This was followed by the appropriate 


test series. The rate of exposure was the same as in the part method. ‘The directions 
were also the same as in the part method. 


Series IV, a set of stimulus cards the English words of which composed Sentence II, 
was presented to Group A in the same manner as Series III was presented to Group B. 
The test procedure was likewise the same. 


Immediately upon completion of the test of reproduction, after exposure of the 
material by the whole method, the same materials to be learned were presented again 
and again by the whole method. It is to be remembered that these cards were pre- 
sented in the serial order of the sentence. Each presentation was continued until 


fifteen attempts at reproduction had been made in all. The scoring and calculating 
were the same as above. 


Protocols were obtained from about one-fifth of the subjects. 


TABULATION OF RESULTS 


I. The whole method proved superior to the part method 
on the criterion of the number of errors on the first attempt 
at reproduction. 

The whole method also proved superior to the part method 
on the criterion of the number of individuals who found one 
method superior to the other. The number of individuals 
for whom the part method proved superior is 20. The num- 
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CoMPARISON OF WHOLE AND Part METHops 














Method Av. No. Errors P.E. Av. Diff. in Av. P.E. Diff. 
ree 9.77 16 
1.83 21 
I i. eeepc hie otal 7.94 25 

















ber of individuals for whom the whole method proved superior 
is 65. The number of individuals for whom the two methods 
proved equally difficult is 19. 

The whole method proved superior to the combination 
method on the criterion of the number of trials necessary for 
complete mastery; but only by a small and unreliable differ- 


ence. 
CoMPARISON OF WHOLE AND CoMBINATION METHODS 














Method Av. No. Trials P.E. Av. Diff. in Av. P.E. Diff. 
cee hicid knee eras 8.24 20 
“or -57 -29 
Combination............. 8.81 .22 

















The whole method proved approximately equal to the com- 
bination method on the criterion of the number of individuals 
who completed learning by either method. By the whole 
method 74 individuals completed learning; by the combina- 
tion method 75 individuals completed learning. 

The whole method proved approximately equal to the 
combination method on the criterion of the number of indi- 
viduals who found one method superior to the other. The 
number of individuals for whom the whole method proved 
superior is 39; the number of individuals for whom the combi- 
nation method proved superior is 37. The number of indi- 
viduals for whom the two methods proved equally difficult is 
(28. 

II. Of 79 individuals who found one method superior to 
the other only 35 or 41 percent found that method superior 
which they habitually use. 

III. When the subjects were divided into groups according 
to sex, there were 57 males and 47 females. Males and fe- 
males learned approximately equally well with the part 
method. 
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CoMPARISON OF SEXES WITH THE Part MetTHopD 














Sex Av. No. Errors P.E. Av. Diff. in Av. P.E. Diff. 
I a ee od ied 10 22 
4 32 
eer 9.6 24 

















The males were superior to the females with the whole method. 


CoMPARISON OF SEXES WITH THE WHOLE METHOD 














Sex Av. No. Errors P.E. Av. Diff. in Av. P.E. Diff. 
ER er ree 8.77 .28 
1.73 43 
be wreaneenens 7.04 33 

















On the criterion of the number of individuals for whom one 
method proved superior to the other, 31 percent of the females 
for whom one method proved superior to the other found the 
part method superior; 18 percent of the males for whom one 
method proved superior to the other found the part method 
superior. 

IV. A group composed of 43 individuals with the average 
chronological age of 26 learned approximately equally as well 


as a group composed of 53 individuals with the average age of 
19 with the part method. 


ComPARISON OF AGE Groups WITH THE Part METHOD 














Av. Age Av. No. Errors P.E. Av. Diff. in Av. P.E. Diff. 
19 9.7 625 
I 32 
26 9.8 .20 

















The younger group proved superior to the older group with 
the whole method, but with an unreliable difference. 


ComMPARISON OF AGE Groups WITH THE WHOLE METHOD 














Av. Age Av. No. Errors P.E. Av. Diff. in Av. P.E. Diff. 
19 7.36 32 
1.26 -47 
26 8.62 
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V. A group composed of 53 individuals with the average 
gross score of 43 on the Otis S-A learned approximately equally 
as well as a group composed of 43 individuals with the average 
gross score of 57 on the Otis S-A with the part method. 


ComparRIsON oF Capacity Groups WITH THE Part METHOD 














Av. Gross Score Av. No. Errors P.E. Av. Diff. in Av. P.E. Diff. 
43 9.85 21 
57 9.72 -29 

















A group with the average gross score of 43 on the Otis S-A 
learned approximately equally as well as a group with the 
average gross score of 57 on the Otis S-A with the whole 
method. 


ComMPARISON oF Capacity GROUPS WITH THE WHOLE METHOD 














Av. Gross Score Av. No. Errors P.E. Av. Diff. in Av. P.E. Diff. 
43 7.89 -33 
25 45 
57 7-64 32 

















INTERPRETATION OF RESULTS 


I. The whole method is superior to the part method even 
where place association is eliminated or negligible. Several 
bits of evidence may be offered to show that place association 
was eliminated or negligible. Of protocols received from 
about one-fifth of the subjects only three indicated a conscious- 
ness of the horizontal or place association. The attention of 
the subjects was apparently directed so firmly to the vertical 
associations between the English words and their nonsense 
syllable associates that they were not conscious of the space 
time relationships of the items of a series which were presented. 

The whole and combination methods proved approxi- 
mately equal by three criteria: the average number of trials 
necessary for complete learning of the material, the number of 
individuals who completed learning by either method; and the 
number of individuals for whom one method proved superior 
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to the other. The slight difference in favor of the whole 
method on the criterion of the average number of trials nec- 
cessary for the complete learning of the material might be 
explained by the fact that the combination method had the 
disadvantage of a larger average number of errors on the 
first test of reproduction, ¢.g. after presentation by the part 
method. 

If place association was neither eliminated nor negligible 
in the whole method, certainly the whole method in which 
the stimulus cards were arranged in the serial order of a sen- 
tence—comprising a meaningful whole—would have proved 
reliably superior to the combination method in which the 
stimulus cards were presented in a non-serial order after the 
first test of reproduction. 

II. Habitual method of learning is not a factor which 
determines the relative efficacy of the whole and part methods 
of learning. However, in the case of experiments in which 
only a few subjects are investigated this factor might be oper- 
ative. This would be an error in sampling. 

The results may have been determined partly by the re- 
liability of each subject’s report as to his habitual method of 
learning. Some subjects probably do not understand their 
own habitual method of learning although they can recognize 
the difference between the whole and part methods when ex- 
plained to them. 

III. A comparison of the data shows that the males are 
both relatively and absolutely superior to the females with the 
whole method. This result is surprising. The writers can 
offer no reason for the superiority except the possibility of an 
error in sampling which the measure of statistical reliability 
does not reveal. 

IV. The younger group proved superior to the older group 
with the whole method, though with an unreliable difference. 
The difference in the age of the groups tested was probably not 
large enough to demonstrate clearly the operation of the fac- 
tor of age. There were a few retarded individuals in the older 
group who may have lowered the average score for the group. 
That is to say, on the basis of evidence furnished by McGeoch, 
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the less intelligent do not succeed so well with the whole 
method as do the more intelligent. 

V. The results of this investigation with regard to I. Q. as 
a factor in the whole-part problem confirm those of McGeoch. 
Though a significant difference is not found between the 
capacity groups with the whole method, the superiority of the 
whole method over the part method for the more intelligent 
group is relatively greater than the superiority of the whole 
method over the part method for the less intelligent group. 

As in the case of the factor of age, the difference between 
the groups is small. Nevertheless a tendency in favor of the 
more intelligent group is shown which might be increased 
with a greater discrepancy in I. Q. 


CONCLUSIONS 


I. Place association is not a factor determining the rela- 
tively greater efficacy of the whole method over the part 
method. This conclusion accords with the observations of 
Crafts and Brown. 

One of the hypotheses which Pechstein tested and rejected 
was disintegration through time. ‘Though he did not find this 
factor operative in maze learning, it might be found to operate 
in verbal learning. For example, in the present investigation 
the number of syllables which were omitted by the subjects 
on the first test of reproduction was greater in the part method 
than in the whole method. 

II. Habitual method of learning is not a determining 
factor. 

III. Age is not a determining factor where age difference 
is not relatively great. 

IV. Sex is a factor determining the relative greater efficacy 
of the whole method over the part method. 

V. Capacity is not a determining factor where the groups 
compared do not differ significantly in gross test score. The 
result of this investigation confirms McGeoch’s results with 
respect to the direction of the tendency of the superior group. 

Individual differences as a determining factor is not ap- 
parent. Only one-fifth of the subjects learned better by the 
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part method. The subsumption of the difference in efficacy of 
the whole and part methods of learning under individual 
differences is questionable. 

The relative efficacy of the whole and part methods of 
learning is not a resultant of place association or of the vari- 
ables tested in this investigation, with the possible exception 
of sex. Positive and dogmatic statements about the deter- 
minants of the relative efficacy of the methods cannot as yet 
be made. 

One possible explanation not yet tested is the operation of 
Jost’s law which states: “‘Of two associations which are of 
equal strength but of different ages, the older receives the 
greater intensification from a new repetition.” Since the 
time which lapses between the repetition of each item is longer 
in the whole method than in the part method—it would seem 
—the strength of association of each item learned by the whole 
method is more intensified by repetition than the strength of 
association of each item learned by the part method, if Jost’s 
law holds for such comparatively short intervals of time. 


(Manuscript received May 28, 1932) 
4 Meumann, E., The Psychology of Learning, D. Appleton (1913), p. 266. . 











MEMORY FOR PLEASANT, UNPLEASANT, AND 
INDIFFERENT PAIRS OF WORDS 


BY HULSEY CASON ann FRANCES CLARA LUNGREN 


University of Wisconsin 


In two previous experiments,! Cason found that pleasant, 
unpleasant, and indifferent pairs of words could be learned 
and retained with approximately equal efficiency. The 
earlier studies had been few in number (1, 3, 4, 5, 6), and the 
somewhat scattered results had not been conclusive. In the 
present experiment the problem has been carried a step further 
by comparing the relative efficiency of learning and retaining 
pleasant, unpleasant, and indifferent pairs of words arranged 
in various combinations. Inthe previous experiments (2), the 
two words in a single pair were P, U, or I; and in the present 
investigation the pairs were made up of words that were P-U, 
U-P, P-I, I-P, U-I, and I-U to the subject. The 468 words 
used in the present study were the same as those used in the 
previous experiments.? All of these words were among the 
3,000 most commonly used words in the English language. 
The P, U, and I words that were used were approximately 
equal in length, and the three kinds of words included 
practically the same per cent. of nouns, verbs, and adjectives. 


Procedure.—Each subject was taken individually, and the second-named author 
was the experimenter for each of the 50 subjects. 

Each of the 468 words had been printed on a stiff 4- by 14-inch card, and all of 
the words were mixed together. 

1. The S was given the collection of words; and he picked up one card at a time, 
looked at the word for a moment, and classified it in one of three boxes as P, U, or I, 
if the word itself was P, U, or I to him, or if the immediate associations were P, U, or I. 
The S continued to classify the words as P, U, or I until the required number had 
been obtained of each kind. 

2. Using the words that had been classified by the S, the E arranged three sets 
of material, and each set consisted of 24 pairs of words. Each set included four of 


each of the following kinds of pairs: P-U, U-P, P-I, I-P, U-I, and I-U. The pairs 





132, Experiments II and III. 
2 The complete list of words used is given in 2, Table 12. 
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of words were rotated in each set so that no one of the six different kinds of pairs 
would have any advantage on account of position. Each of the three sets of material, 
of 24 pairs each, was arranged on a stiff cardboard, and a key showing which pairs 
were P-U, U-P, P-I, I-P, U-I, and I-U to the S could be folded out at the side of 
the cardboard or it could be folded out of sight. 

3. The material in a single set was presented orally two times to the S, and the 
pairs followed each other in an irregular order at intervals of five seconds. S was 
instructed to distribute his study and effort equally between all of the pairs in a single 
set of material, and to try to connect the words in each pair in the forward direction 
only during the time interval intended for that pair. Later when the first word in 
a pair was spoken by the E the S was to try to give the second. A special effort was 
made to have the S distribute his study and effort equally between the six different 
kinds of pairs. The conditions seemed to be as equally favorable for the six different 
kinds of pairs as they could be made. 

4. E then tested S’s learning orally, and the pairs were again taken in an irregular 
order. When S could not give the second word in a pair, E prompted him by repeating 
both of the words, and on this occasion S was given an additional five seconds to try 
to connect the words in the pair. S was instructed to try to learn all of the words 
that had been missed. 

5. Procedures 3 and 4 above were followed without delay for the second and 
third sets of material, but in some cases the lists were presented once or three times, 
and the interval between E’s reading of the successive pairs was occasionally seven 
and a half or ten seconds in length. These variations adapted the procedure to the 
different abilities of the different S’s, and in the second and third sets of material it 
decreased the number of S’s who learned an unusually small or an unusually large 
number of pairs out of a possible 24. Changing the time intervals for different S’s 
gave whatever differences there were in the efficiency of learning the six different kinds 
of pairs a better chance of appearing, but it did not prejudice the results in favor of 
any of the different kinds of pairs. 

6. A copy was made of the three sets of material; and S’s retention of the pairs 
was again tested in the same way after an interval of one, two, or three days. 

A number of precautions were taken so that during the oral presentation of the 
words, during the test for Learning, and during the test for Retention, the E herself 
would not know which words had been judged P, U, and I by the S._ Every possible 
control was used so that the conditions would be equally favorable for learning and 
retaining the six different kinds of pairs. 

Subjects—A total of 50 S’s was used, and most of them were undergraduate 
university students. Their sex and age distribution was as follows: 





Age Men Women 
Re chbsndbesedwebewe 2 2 
RS cieaidah akan nied 2 I 
ee 8 18 
I5-I9.. 4 13 
PR coos see cbcacancwns 16 34 


Results.—Table I shows the relative ranks for different S’s 
in the tests for Learning and for Retention for the six different 
kinds of pairs. The figure in the upper left corner of the table 
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shows that 10 S’s out of a total of 50 learned the P-U pairs 
more efficiently than the five other kinds of pairs. The rela- 
tive rank in learning the P-U pairs was a tie between fourth 
and fifth places (a rank of 43) for 11 S’s outof 50. The aver- 
age rank in the efficiency of learning and retaining the six 
different kinds of pairs is shown in next to the last column of 
the table. The combined resulting order of rank is given in 
the last column. The outstanding feature of the results in 
Table 1 is the small and unreliable differences in the ranks of 
the six different kinds of pairs. The large amount of over- 
lapping is much more significant than the small differences 
between some of the average scores. 


TABLE I 


RELATIVE RANKS FOR DIFFERENT SUBJECTS IN LEARNING AND RETAINING 
P-U, U-P, P-I, I-P, U-I, anp I-U Parrs or Worps 




















Rank r | 31] 2] 2413 13414 14415 | 54 | 6 | Av. | Rank 
Rank| Order 

P-Uj1o] 3/1] 2] 8} 4] 1/11} 315 | 213-35 I 

U-P|} 6) 2}3]5 {101 41/3] 314]5 | 513-49] 4 
Learning P-I | 4] 2]/4]/5]5|6/3] 515141] 713-74] 6 
N = 50 I-P | 4441317] 6| 812] 37114] 7/3-42] 3 
U-T}| 5} 21/3/61] 319]4] 31/4161] 513.64] 5 

IHU} 7/5 }5/3]6)/ 5/31] 3} 2] 24] 913-36] 2 

P-U| 61 4]2]4115|2/31]414]2] 413-26] 2 

Retention after|U-P| 4} 4]5]1|f1r}o{]9] 51713 1 113-43) 3 
one, two, or|P-I | 6/4]21]51] 51119] 616] 31 313-491 4 
three days I-P j1o} 515131413121 5/5]6] 2]3.16| 1 
N = 50 U-I |} 4]}0]/6]5]5)41]6] 2}6]5 | 713.80] 5 
IU] 613/612] 510]/3] 41315 |13|3-86|] 6 















































The combined average results of all of the S’s are shown 
in Table II. Inthe test for Learning the S’s learned an aver- 
age of 2.33 + .o7 P-U pairs of words out of a possible four; 
and in the test for Retention they retained an average of 1.41 
+ .06 P-U pairs of words out of a possible four. In the test 
for Learning and in the test for Retention, the average scores 
for the six different kinds of pairs of words are approximately 
the same. The differences between the averages are unrelia- 
ble; and the most significant feature of the results is the large 
amount of overlapping in the individual scores. The results 
do not favor the superior learning or retention of any one of the 
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six different kinds of pairs. The most plausible conclusion 
seems to be that there is little, if any, difference in the relative 
efficiency of learning and retaining P-U, U-P, P-I, I-P, U-I, 
and IJ-U pairs of words. 


(Manuscript received July 14, 1931) 
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EQUAL WEIGHTS AND PSYCHOPHYSICAL 
JUDGMENTS 


BY LEON ARONS anv FRANCIS W. IRWIN 


University of Pennsyloania 


INTRODUCTION 

In the method of constant stimuli, judgments are made by 
an S on pairs of stimuli, each pair consisting of a standard and 
a comparison stimulus. The standard stimulus remains 
constant throughout the experiment, but the comparison stim- 
uli form a graded series extending from a stimulus value less 
than, to a stimulus value greater than the standard, usually 
including one stimulus equal to the standard. The present 
investigation may be considered a control experiment in which, 
in contrast to the usual procedure, the standard and compari- 
son stimuli remain objectively equal. As far as the authors 
are aware, such an experiment has never before been per- 
formed. 


METHOD 


The apparatus and method of lifting were as described by Fernberger.!. Twenty- 
eight cylindrical brass weights of 100 g. each were used. They were checked both at 
the beginning and at the end of the experiment. All weights varied less than 20 mg. 
from 100 g. A revolving table and a chair with arm rest and screen were employed. 
The rhythm of lifting was maintained by a metronome set at 92, without bell signal. 

The S was seated at the revolving table. The arm rest and screen were adjusted. 
Instructions were given as follows: 

“You will be presented with a series of weights in pairs. You are to judge the 
second weight of each pair as heavier than, equal to, or lighter than the first, using 
the terms ‘heavier,’ ‘equal,’ or ‘lighter’ in your report. If you are not certain of 
any judgment, guess.” 

These instructions were read to the S at the beginning of each experimental 
session. Seven Ss served in the experiment: Dr. S. W. Fernberger (F), Dr. H. S. 
Oberly (Oy), Dr. C. L. Altmaier, Jr. (A), Mr. T. J. Snee (Sn), Mr. W. N. Hallett (H), 
Mr. W. Carlson (C), and Mr. S. Tompkins (T).2, With the exception of T, a graduate 





1S. W. Fernberger: On the relation of the methods of just perceptible differences 
and constant stimuli, Psychol. Monog., 1913, 14, 4 (Whole No. 61), 6-11. 

2 The authors wish to express their indebtedness to their Ss, and particularly to 
Professor Fernberger for his advice and encouragement. 
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student, all were members of the Department of Psychology. Of the Ss, F and Oy 
were very experienced in weight lifting; H and T moderately experienced; while C, S, 
and A were totally inexperienced, having never before participated in a weight-lifting 
experiment. The number of judgments obtained from each S was as follows: F—soo, 
Oy—soo, A—soo, Sn—soo, H—2500, C—1000, T—s500. Two hundred and fifty 
judgments were obtained in each experimental sitting, except in the case of F who 
gave 200 and 300 in two sittings respectively. F, and H (during his first six sittings), 
the first two Ss, were allowed to pause for rest whenever they felt at all fatigued. 
Rest periods were then standardized, one being introduced after each series of 50 
judgments. The number of rest periods taken by F corresponds closely to the standard 
number later allowed. H took fewer than the standard number in his first six sittings. 

The lifting technique was demonstrated to the naive Ss (C, S, A) at the outset. 
They were allowed to practice five times using the same weight. The regular instruc- 
tions were then given and the experiment was begun. No preliminary trials were 
given to the practiced Ss. The Ss were not informed as to the special conditions or 
the purpose of the investigation. 

The experiment was carried on at the Laboratory of Psychology at the University 
of Pennsylvania during the spring and fall of 1931. 


RESULTS 


The frequencies of judgments given by each S in each 
category will be found in Tables I and II. The results for 
individual sittings are given separately for each S. The 250 
judgments in an individual sitting will be found fractionated 
into five successive series of 50 judgments. ‘The results of F 
are irregular in that his two sittings consisted of four and six 
series of 50, respectively, and have been so tabulated. The 
columns headed L, E, and H contain the number of judgments 
of ‘lighter,’ ‘equal,’ and ‘heavier’ for each series of 50. In 
the same columns opposite P are the probability values of the 
different categories for each sitting as a whole, determined by 
the sum of the judgments in each category divided by the 
total number of judgments in all three categories (250 except 
in the case of F). The mean variations of the frequencies 
have been calculated from all of the judgments given by each 
S, on the basis of the frequencies in series of 50 judgments. 


DIscuSSsION 


In the discussion of the results we shall concern ourselves 
with the following: (1) general considerations, (2) fatigue 


effects, (3) the effect of previous comparisons, and (4) over- 
estimation. 
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TABLE I 
FREQUENCIES OF JUDGMENTS 


Subject: H (2500 j’s) 











Sitting L E H Sitting L E H 
17 5 28 18 13 19 
8 14 28 17 19 14 
I 15 10 25 6 15 19 16 
II 12 27 12 12 26 
13 13 24 16 19 1S 
P 256 216 528 r 312 328 360 
16 15 19 16 1S 19 
12 16 22 20 12 18 
2 12 16 22 7 12 18 20 
12 8 30 8 2 19 
16 13 21 II 17 22 
P 272 272 456 P 268 340 392 
20 12 18 IS 14 21 
9 15 26 12 20 18 

3 18 12 20 8 1S 12 2 
13 18 19 12 IS 23 
14 12 24 12 15 23 
P .296 .276 .428 P 264 304 432 
6 29 15 13 18 19 
II 21 18 II 20 19 
4 8 23 19 9 13 10 27 
II 24 IS 19 13 18 
8 27 IS IS 10 25 
P .176 .496 328 P .284 .284 .432 
7 26 17 19 18 13 
13 16 21 14 19 17 
5 9 16 25 10 10 13 27 
10 18 22 19 5 26 
13 19 18 II 10 29 
P .208 .380 412 P .292 .260 .448 
Total freq. 657 789 1054 
Mean var. 2.84 4.01 3.61 
P for total .263 316 422 


























1. General Considerations 


An examination of the probabilities for the various judg- 
ment categories (Tables I and II) reveals great individual 
differences, despite the fact that all of the weights were 
objectively equal. This demonstrates clearly that objective 
differences in stimuli are by no means necessary for difference 
judgments. In no case is the probability value of ‘equals’ 
(for the totals) greater than that of the other two categories, 
although in the case of A the values for ‘equals’ and ‘heaviers’ 
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are the same. In the case of individual sittings there are but 
three instances in which the probability value for ‘equals’ is 
the greatest: H’s 4th, C’s 2d, and A’s Ist. The probabilities 
for individual sittings vary from .024 to .370 for ‘lighters,’ 
.132 to .496 for ‘equals,’ and .328 to .592 for ‘heaviers.’ 
Most of this variation is from S to S, the variation for individ- 
ual Ss being relatively small as is readily seen from the mean 
variations. The relative amount of variability of the three 
categories of judgment for each S is given in Table III. 











TABLE III 
VARIABILITY OF JUDGMENT FREQUENCIES IN THE THREE CATEGORIES 
Ss L E H 

PP eerer TT Tere 17.0 30.0 5.4 
AE eee ee 10.7 8.6 10.2 
tte eka aad cme 12.3 9.3 8.1 
EERE TP Te 10.2 9.5 9.5 
, er Te 7.1 14.7 11.2 
errr ee 21.7 25.4 17.1 
er ery 32.5 16.3 6.0 














These values have been computed by the formula: Variability 
= MV X 100/Mean. In general, the category ‘heavier’ 
shows the least variability, and ‘lighter’ shows the greatest. 
For 4 Ss the category ‘heavier’ is the least variable and for 
none is it the most. Likewise, for 4 Ss ‘lighter’ is the most 
variable and for one S, the least. It seems significant that 
this one S (T) gave by far the smallest proportion of lighter 
judgments. 

‘Sophistication’ in weight-lifting seems to have had no 
well-defined effect on the relative frequencies and the con- 
stancy of the results, as may be seen from the comparison of 
the results of F and Oy (the two most sophisticated Ss) with 
one another and with the other Ss. 

There are no apparent trends attributable to the effect of 
practice. ‘These trends might be expected to show themselves 
particularly with Ss H and C who gave the most extended 
series of judgments. Not only is this not the case, but the 
size of the mean variations indicates that such an influence, 
if present, was minimal. More extended experimentation 
using more Ss may, however, reveal effects due to practice. 
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As stated above, no S was given information with regard to 
the nature of the experiment. At the end of his second sitting, 
Oy, our most sophisticated 5, stated his belief that all the 
weights were equal with the exception of two or three which 
were definitely heavier or lighter than the others. To F, 
there seemed to be more variability in the standard than in 
the comparison weights; at one time he declared that there 
was “‘no question as to the first weights differing.” The other 
Ss seemed to take the experiment as a matter of course. 
From time to time, various of the Ss spontaneously remarked 
‘much heavier’ or ‘much lighter.’ Thus it appears that 
many of the difference judgments were given with a high 
degree of subjective assurance. 


2. Fatigue 


In order to analyze the results for the possible influence of 
fatigue, they have been arranged as in Table IV. The data 
are given for each S separately and for the total. The results 
of F have been omitted from this table, since his sittings con- 
sisted of 200 and 300 judgments respectively and are, for that 
reason, not comparable with the others. The numbers I to 
5 in the first column indicate the five divisions (series) of a 
sitting, in temporal order. The corresponding rows give the 
total number of judgments in each category for each of these 
five series. Thus, totaling the results of H for the first series 
in all of his ten sittings, we find that he gave 147 ‘lighter,’ 165 
‘equal,’ and 188 ‘heavier’ judgments. 

From an inspection of this table it is evident that while 
there are variations, there are no definite trends. This is 
true both for the individuals and for the totals. Since fatigue 
might be expected to manifest itself in some definite tendency 
relative to the five temporal divisions in the sitting, we con- 
clude that its effect, if present, was negligible. 


3. The Effect of Previous Comparisons 
In dealing with the results, the questions arise: to what 
extent is the comparison-judgment made upon a given stim- 
ulus pair dependent merely upon the activity involved in 
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making that comparison? On the other hand, to what extent 
does it depend upon preceding comparisons in the series? 3 
This analysis is particularly interesting since, with the present 
method, stimulus differences are not among the variables 
involved in determining the judgment, and hence do not 
require mathematical elimination. Conceivably, any com- 
parison may be influenced by any previous comparison or 
combination of previous comparisons. While our method is 
adequate to study any of these influences, we have chosen the 
simplest, and presumably the most important, namely, the 
influence of the immediately preceding comparison. 

The problem reduces to the question as to whether succes- 
sive judgments in the various categories are distributed on a 
chance basis. ‘Thus, for example, ‘heavier’ judgments are 
immediately followed sometimes by ‘heaviers,’ at other times 
by ‘lighters’ or ‘equals.’ Will the distribution of such sub- 
sequent judgments correspond with an estimation made on 
the basis of the observed relative frequencies for the respective 
categories? 

With the three judgment categories, there are nine possible 
sequence-pairs: ‘lighter-lighter,’ ‘lighter-equal,’ ‘lighter-heav- 
ier, ‘equal-lighter,’ ‘equal-equal,’ ‘equal-heavier,’ ‘heavier- 
lighter,’ ‘heavier-equal,’ and ‘heavier-heavier.’ If, now, we 
consider the relative frequency of judgments in a given cate- 
gory as establishing an empirical probability of occurrence of 
judgments in that category, the probability of occurrence of 
any sequence-pair will be given by the product of the two 


3 For a general discussion of inter- and intra-serial effects, and numerous references, 
see W. D. Turner, Intra-serial effects with lifted weights, Amer. J. Psychol., 1931, 
43, 1-25. These studies are essentially investigations of the effect of stimulus differences 
upon subsequent judgments. The effects, if any, of the comparisons themselves are 
not subjected to analysis. Turner’s conclusion that the effects he found were ‘not 
due to the nature of the previous judgments themselves’ was warranted by the evidence 
that the effects increased with an increase in the stimulus differences. However, 
some effect of previous comparisons may well have been present in the form of a 
constant factor which his procedure was incapable of analyzing. 

It may be noted that Martin and Miller, Zur Analyse der Unterschtedsempfindlich- 
keit, Leipzig, 1899, pp. 155-167, have employed a procedure similar to ours to the 
extent of introducing 5 comparison weights equal to the standard (Vexirversuche), 
in a series of 8 in order to investigate the influence of the comparison weight of 
one pair upon the judgment on the comparison weight of a succeeding pair. 
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relative frequencies concerned. For example, if the relative 
frequency for ‘heaviers’ for a given S is .50, and for ‘lighters,’ 
.30, we should expect 15 percent of all the sequence-pairs to be 
‘heavier-lighter,’ and 15 percent ‘lighter-heavier.’ 

In Tables V-IX will be found the results obtained by this 
analysis for each sitting of each S, and the results taken as a 
whole for individual Ss (under ‘Total’). In the first column 
are listed the 9 possible judgment sequences, LL, for example, 
representing a ‘lighter’ followed by a ‘lighter.’ In the third 
column will be found the number of these sequences observed, 
as obtained by actual count from the data sheets. Pairs 
split by a rest period or by any other interruption have not 
been used in these computations. ‘The figures in this column 
opposite ‘Total’ in the first column give the number of ‘light- 
ers,’ ‘equals,’ and ‘heaviers’ involved in these pairs. The 
second column (‘Calc.’) gives the number of pairs of each 
kind expected on the basis of chance. ‘These values have been 
obtained by multiplying the total number of pairs in each sit- 
ting by the various sequence-pair probabilities calculated as 
described above. In the fourth column will be found the 
standard errors (¢) of the calculated values computed from the 
formula Gtreq, = YpgN. The fifth column gives the differ- 
ences between the observed and calculated values. Minus 
signs indicate that the number of cases observed was less than 
the number calculated. The slight discrepancies between the 
total numbers of pairs (given by ‘Total L, E, H’) for the cal- 
culated and observed values are to be accounted for in terms 
of the number of decimal places used in the computations. 
The sixth column gives the ratios of the differences to the 
standard errors, the signs of the differences being preserved. 
These ratios for the totals of each S have been collected in 
Table X. 

It seems obvious from the magnitude of many of the criti- 
cal ratios in the tables that judgments are not completely 
independent of the preceding comparisons in the series. This 
seems to hold for all subjects except T for whom these values 
are consistently low. ‘There are individual differences in 
both the direction and the magnitude of the effect as dis- 








TABLE V 


ComMPARISON OF OBSERVED AND CALCULATED FREQUENCIES OF THE 
Various SEQUENCE-PAIRS 


Subject: H 














Calc. 





16.3 
13.2 
33-7 


13.2 
10.7 
27.1 


33-7 
27.1 


69.3 
244.3 
































Calc. 


D/e 





21.6 
20.4 
31.0 


20.4 


19.2 
29.0 


31.0 
29.0 
44.0 


245.6 





























Sitting 
, | Seer 
EL. 
tes a eee eee 
EPs 
HL. 
Seer 
eee oe 
er 
: |= = apeverr 
Sitting 
+ am 
LE.. 
LH... 
|: eee 
EL.. 
EE.. 
nee ere 
| 
HL. 
re eee 
eee 
ere 
, . = % Sere 
Sitting 





Calc. 





| EN eres 





11.1 
19.8 
21.5 


19.8 
35.1 
38.5 


21.5 


38.5 
42.2 


248.0 
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TABLE V (Concluded) 
Subject: H (Concluded) 
















































































VII VIII 
Sitting 
Calc.} Obs.| ¢ D D/e | Calc.| Obs.| D D/e 
Es : fF = =—hch 84] 15.7} 19] 3.83] 3.3 86 
LE. 2... ee cee ee ee ee eee] 223] 19] 4.50]/— 3.3]— -73] 19.1] 19] 4.20/— O.1 02 
LH. .. 2.0.2.0 ceeeseees] 25.0] 2514-74] . 0.0] .00} 27.2] 2414.92/— 3.21\— .65 
ere er 64 62 
EL 22.3] 20] 4.50/— 2.3|— .51| 19.1] 19] 4.20/— O.1|/— .02 
EE 29.4] 29|5.09)— 0.4/— .08] 23.5) 19] 4.61|/— 4.5|— .98 
eer 33-31 3615.37] 2.7 50] 33-1] 3815.35) 4.9 92 
| eee 85 76 
HL 25.0] 25|4.74| 0.0 00] 27.2] 271 4.92|— 0.2/— .04 
ere eee rer 33-3] 3515-37] 1-7 32| 33-11 3615.35} 2.9] .54 
ee er 37-7| 36] 5.65|— 1.7|/— .30] 46.8] 44]6.15|— 2.8/— .46 
, | errr 96 107 
pS Serre 245.0] 245 244.8] 245 
IX x 
Sitting 
Calc.} Obs.| D D/e | Calc.| Obs.) o D Die 
LL 20.1] 14] 4.29]/— 6.1|—1.42] 20.1] 21] 4.29] 09 21 
Ee eerie 20.1] 21}4.29] 0.9]  .21] 18.4] 14] 4.12|— 4.4|/—1.07 
ae 30.1] 35|5-14] 4.9] .95| 31-9] 35]|5.26) 3.1 59 
I ee eee eae 70 70 
EL 20.1] 25|4.29] 4.9] 1.14] 18.4] 18] 4.12/— 0.4/— .10 
EE 20.1} 22] 4.29 1.9| .44| 16.7] 181] 3.94 1.3 33 
PMictihsisieeesek aad 30.1] 23] 5.14/— 7.1|—1.38] 28.9] 28|5.05|— o.g/— .18 
| eee 70 64 
i his einbenad meee 30.1] 3115.14] 09 18] 31.9] 32|5.26) o.1 02 
ey rr 30.1} 2615.14/— 4.1|— .80} 28.9] 31] 5.05] 2.1 42 
Panne 45-1] 48|6.06} 2.9] .49) 50.2] 48|6.32/— 2.2/— .35 
eee 105 III 
WOUND Big Me The nc ce seses 245.9] 245 245-4) 245 
Total 
Sitting 
Calc Obs. o D Die 

eee 168.4 175 12.51 6.6 53 
Peery 203.3 172 13.65 — 31.3 — 2.29 
nc ceecie Keenan 271.6 296 15.54 24.4 1.57 
OO ee ee 643 
Ai ee seis a etal diitin 203.3 197 13.61 — 63 — .46 
Eee 245.6 277 14.87 31.4 2.11 
ere err 328.1 303 16.86 —25.1 —1.49 
is Chnkeh ee wanid 777 
HL 271.6 275 15.54 3-4 .22 
ee 328.1 325 16.86 — 3.1 — .18 
ere 438.3 438 18.98 — 03 — .02 
, RE 1038 
, | ee 2458.3 2458 


























TABLE VI 





CoMPARISON OF OBSERVED AND CALCULATED FREQUENCIES OF THE 
Various SEQUENCE-PAIRS 


Subject: C 






















































































I II 
Sitting 
Calc.| Obs.| o D D/e | Calc.| Obs.| o@ D D/e 
SS 17.6} 16] 4.05|— 1.6J— .40| 15.7} 18]3.83] 2.3]  .60 
| 22.5} 2314.52} 0.5] .11| 23.3] 2814.59] 4.7] 1.02 
LH... 25.5| 27] 4.78 1.5} .31] 23.0] 1614.57|— 7.0/—1.53 
Total I 66 62 
EL.. 22.5] 2714.52] 4.5] 1.00] 23.3] 26/4.59] 2.7] .59 
EE.. 28.9] 24]5.05|— 4.9/— .97] 34.5] 22|5.45|/—12.5|—2.29 
rer errr er 32.6] 3315.31 -4{ .08) 34.1] 4415.42] 9.9] 1.83 
Total I 84 g2 
HL. 25.5] 2214.78)— 3.5|— .73] 23.0} 20] 4.57/— 3.0/— .66 
DS a beara ake e ened 32.6] 3815.31 5-4] 1.02] 34.1] 41|5.42| 6.9] 1.27 
HH.. 37-0] 35|5.60/— 2.0/— .36| 33.8] 30]5.401— 3.8/— .70 
tk enc ane ee 95 gI 
WOU Big Big BE a cc cceans 244.7| 245 245.7| 245 
Ul IV 
Sitting 
Calc.| Obs.| o@ D D/o | Calc.} Obs.| o D Die 
LL.. 16.7| 12|3.94|— 4.7/—1.19] 14.9] I1|3.74/— 3.9/—1.04 
LE... 20.8} 16|4.37|\— 4.8/—1.10] 20.3] 20] 4.31|/— 0.3/— .07 
26.5| 3614.86] 9.5] 1.95] 25.11 2914.75] 3.9] .82 
_ | er 64 60 
EL.. 20.8} 2814.37] 7.2! 1.65} 20.3] 25/4.31] 4.7] 1.09 
a. 26.2} 19] 4.84|— 7.2/—1.49] 27.6] 24] 4.95|— 3.6/— .73 
rer errr re 33-11 33|5-35|— O.1|— .02] 34.2] 3315.51/— 1.2]/— .22 
Total E 80 82 
HL.. 26.5} 25|4.86/— 1.5|— .31] 25.1] 22] 4.75|— 3-1]— .65 
PP etecugkenesien ned 33-1] 4315-35] 9-9] 1.85] 34.2] 4115.51] 6.8] 1.23 
er 41.7] 33|5.88|— 8.7|—1.48] 42.7] 3915.94/— 3-7/— .62 
eee 101 102 
ye rrr 245.4] 245 244.4| 244 
Total 
Sitting 
Calc. Obs. o D D/e 

| ee eee 64.6 57 7.77 — 7.6 — .98 
LE.. 87.1 87 8.91 — oO! — .OI 
Ee ee rer ree 99.9 108 9.47 8.1 86 
seb eennsadnnnds 252 
Ee ree 87.1 106 8.91 18.9 2.12 
EEE Teer re 116.5 89 10.13 — 27.5 —2.71 
rere 134.1 143 10.8 8.9 82 
0) eee 338 
HL. 99.9 89 9.47 — 10.9 —1.15 
rer err ere 134.1 163 10.8 28.9 2.68 
eer se 154.7 137 11.4 —17.7 —1.55 
0 Perr rreS 389 
We Gs En Es basse ones 978.0 979 
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TABLE VII 
CoMPARISON OF OBSERVED AND CALCULATED FREQUENCIES OF THE 
Various SEQUENCE-PAIRS 





















































Subject: F 
I II 
Sitting 
Calc.| Obs.| @ D D/o | Calc.; Obs.| o D D/e 
LL... 1.2... eee ee ee | 27-3] 2314.85|— 4.3]— -89) 39.7] 3415.86)— 5.7/— .97 
LE... ..........622206-] 19.1] 2614.15] 6.9} 1.66) 26.5) 251 4.91/— 1.5|— .31 
LH... .............2206-] 26.5] 2414.79|— 2.51— .52] 41-7] 4915.98] 7.3) 1.22 
DRL Kadeeseede sans 73 108 
EL 19.1] 19] 4.15|— O.1|— .02] 26.5] 30] 4.91 3.5 71 
EE 13-5| 9|3-541— 4.5|—1.27| 17.6] 12] 4.07|— 5.6|—1.38 
re ree 18.5] 23|4.09] 4.5] 1.10} 27.9] 30|5.03} 2.1 42 
Total E.. 51 72 
HL .| 26.5] 2914-79) 2.5 52] 41.7] 4115.98|— 0.7/— .12 
Rats a etdiemcdstanel 18.5] 17] 4.09/— 1.5|— 27.9] 36] 5.03 8.1) 1.61 
ee .| 25.7] 25|4-73|— 0.7/— .15| 44.4] 37|6.14/— 7.4]/—1.21 
: ree 71 114 
a SS rere 194.7! 195 293.9] 294 
Total 
Sitting 
Calc Obs. o D Die 

fee 67.0 57 7.60 —10.0 —1.32 
7 ashi ia hea. sel Mintek ekedihileed 45:5 51 = 5.5 85 

etd Sah aca a aardk ies acini " 73 7.67 4.5 59 
- Eee 181 
EL 5.5 49 6.46 3.5 54 
EE.. 31.3 21 5-41 —10.3 — 1.90 
err See 6.5 53 6.48 6.5 1.00 
, | are 123 
HL 68.5 70 7.67 1.5 20 
AA id wide wasewaliain 46.5 53 6.48 6.5 1.00 
reer ee 69.9 62 7.74 — 7.9 —1.02 
0 ree 185 
a. . ' & ere 489.2 489 
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TABLE VII (Concluded) 
Subject: A 


LEON ARONS AND FRANCIS W. IRWIN 


















































I II 
Sitting 

Calc.| Obs.| ¢ D D/a | Calc.| Obs.| o D Die 
hk bikie thn Abad kates 11.5} 9|3.31/— 2.5|— .76| 10.5] 6)3.18|/— 4.5|—1.42 
LE... 1. ee ee ee ee ee eee] 20-1] 2414.39] 2.9] 66} 19.8] 19] 4.27/— 0.8/— .19 
LH... 2... 6. eee ee ee eee] 20.3] 201 4.32/— 0.31— .07] 20.6] 2614.37] 5.4] 1.24 
ree 53 51 
PRiGtes 400s soceeees cee i Skee 1.9} .43| 19.8] 2114.27 1.2} .28 
EE... 2... ee eee ee ee ees] 39-2] 3515-741— 4-2/— .73] 37-0] 34|5.60)— 3.01— .54 
ERs ccccccccsecesececl SR @OIGEGL 23) 6890.6 20184 «28.s, (8 
, 5 eee 98 95 
HL.. 20.3] 20|4.32)— 0.3|— .07| 20.6] 2414.37] 3.4] .78 
ee ere 37-7| 4215.65} 4.3] .76] 38.5] 42]5.69] 3.5] .62 
ee ree 36.0] 3215.54/— 4.0|— .72] 39.9] 33|5.78|— 6.9|/—1.19 
rere 94 99 
DEE Be Be Bs 6 va vcsen 244.9] 245 245.2] 245 

Total 
Sitting 
Calc. Obs. o D Die 

LL.. 22.1 15 4:59 — 7.1 —1.55 
LE.. 41.2 43 6.14 1.8 29 
LH. . 41.2 46 6.14 4.8 .78 
Total L 104 
EL. 41.2 44 6.14 2.8 .46 
ti6¢6<ascetvenueneee 76.0 68 8.01 — 8.0 — 1.00 
Serer rere 76.0 81 8.01 5.0 62 
fer 193 
HL. 41.2 44 6.14 2.8 .46 
eo 76.0 85 8.01 9.0 1.12 
Serr rere 76.0 64 8.01 — 12.0 —1.50 
re 193 
, err 490.9 490 
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TABLE VIII 


CoMPARISON OF OBSERVED AND CALCULATED FREQUENCIES OF THE 


Various SEQUENCE-PAIRS 


















































Subject: Oy 
I II 
Sitting 

Calc.| Obs.|  o¢ D D/e | Calc.| Obs.| o D Die 
LL 10.3] 4]3.14/— 6.3]—2.01] 3.4] 1/|1.84)— 2.4/—1.30 
Serer 10.8} §/3.21/— §.8|/—1.81] 8.3] 4)2.84|— 4.3/—1.51 
| 28.9] 41]5.05| 12.1] 2.40] 17.2] 2413.99] 6.8] 1.70 
Total L 50 29 
EL .| 10.8) 14] 3 2} 1.00] 8.3] 11]2.84) 2.7] .95 
EE | INS] 12]3.31] 0.5] .15| 20.6] 7] 4.34/—13.6|—3.13 
eee 30.6] 27|5.18|— 3.6|— .69] 42.1] 531|5.87| 10.9] 1.86 
: |, 53 71 
HL .| 28.9] 3315.05] 4.1 81} 17.2] 17] 3.99/— 0.2;/— .05 
HE .| 30.6] 37|5.18] 6.4] 1.24] 42.1] 62]5.87) 19.9] 3.39 
ch skal aii wal ad aa 82.3] 72] 7.39|—10.3|—1.39] 85.8] 66] 7.47|—19.8] —2.65 
0 es 142 145 
. | err 244.7| 245 245.0] 245 

Total 
Sitting 
Calc Obs. o D Di/e 

Ee ee ee 12.7 5 2.40 — 7.7 —3.21 
Seer Te 20.1 9 4.01 —I0I —2.77 
errr ree 46.1 65 5.65 18.9 3-35 
|, Pere rer 79 
cticeeesiabeademiee 20.1 25 4.01 4.9 1.22 
SS rere ere 31.4 19 6.14 —12.4 — 2.02 
Peer ee 72.5 80 8.35 7.5 .go 
ee 124 
Se err ee ee 46.1 50 5.65 3.9 69 
Di citréeteneneneewes 72.5 99 8.35 26.5 3.17 
errr 168.1 138 10.50 —30.1 — 2.87 
REESE Sr 287 
WOES Eee Mig BBs ic bccccces 489.6 490 
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TABLE VIII (Concluded) 


LEON ARONS AND FRANCIS W. IRWIN 


















































Subject: T 
I II 
Sitting 

Calc.| Obs.|  ¢ D Di/e | Calc.| Obs.) o D Die 
eee 0.2} 110.49] 0.8] 1.62} 0.0] O]0.00; OOF — 
eer rr 3-71 3|1.90/— 0.7|— .37] 2.2] 1] 1.48/— 1.2)— .81 
LH... 2.0... 2. ee ee ee eee] 2] 4 | 2.03]— 0.2]/— .10] 2.5) 4] 1.56] 1.5] .96 
CL, k4on cw dneeens 8 5 
EL.. 3-7| 311-90/\— 0.7|/— .37] 2.2] 4]1.48] 1.8] 1.22 
EE.. 47-51 49|6.19} 1.5] .24] 51.2} 50/6.38/— 1.2/— .19 
errr eee 56.8] 56|6.61|— 0.8/— .12] 58.6] 58|6.68|— 0.6/— .09 
THE Ws oe vsccces 108 112 
HL. ..............-2020] G2] 4:12.03] — O.2]— .10] 2.5) 2)1.56)— O5i— .3 
HE...................-] §6.8] §8|6.61] 1.2} .18] 58.6) 60/668} 1.4 
HH...................] 68.1] 67] 7.01/— 1.1]/— .16} 66.6] 66|6.96/— 0.6/— 
i eeerree rs 129 128 
, | SS eee 245.2] 245 244.4] 245 

Total 
Sitting 
Cale. Obs. ¢ D Die 

Sere rer ere 0.5 I 2.21 0.5 23 
LE.. 5.9 4 2.41 —1.9 —.79 
aaa 6.9 x 2.60 I.1 42 
Total L....... 13 
re ee ee 5-9 7 2.41 1.1 .46 
ven taken aeen aed 99.0 99 8.89 0.0 .00 
EH.. 115.2 114 9.39 —1.2 —.13 
Total E 220 
HL. 6.9 6 2.60 —0.9 —.35 
HE... 115.2 118 9.39 2.8 230 
SN sain hibia abs 0ie wd ales 134.8 133 9.88 —1.8 —.18 
es 257 
WOES By lg Bs 6 one cece 490.3 490 
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TABLE IX 


CoMPARISON OF OBSERVED AND CALCULATED FREQUENCIES OF 
Various SEQUENCE-PAIRS 


Subject: Sn 


















































I II 
Sitting 

Calc.| Obs.| ¢ D D/e | Calc.| Obs.| o« D Die 
LL... 2. ee ee ee ee ee | 20.6] 19] 4.35|— 1.6)— .37] 22.0] 151 4.47]/— 7.0|—1.57 
Feo 3-48|— 4-7|—1-35 9.7| 71|3.05|— 2.7|— .89 
00.6046 s0%00000e0e0 08 BE ree S| 1.15) 41.4] $115.86) 9.6) 1.64 
, ii 71 73 
EL 12.7] 13] 3-48) 0.3) .09) 9.7] I1| 3.05 3} «43 
eee ore 2.76|— 1.81—- .65) 4.1) 2)|2.01/— 2.1]—1.04 
DN ite hkeedenens een 23.5} 25/461] 1.5] .33| 18.2] 19]4.10) 0.8 20 
, SNe er 44 32 
HL | 37-5] 39] 5-64] 1-5] .27| 41-4] 46]5.86) 4.6) — .78 
ere eres 23.5] 3114.61) 7.5] 1.63) 18.2] 23/4.10) 4.8] 1.17 
.| 69.1] 60] 7.04/— 9.1/—1.29] 77.4] 681] 7.26|— 9.4]/—1.29 
0 errs 130 137 
WOON By By BB... 000s. 244.9] 245 242.1] 242 

Total 
Sitting 
Calc. Obs. o D Die 

RS ark ta Ghee ed ee epieg 42.9 34 6.25 — 8.9 —1.42 
ir cied eetaleeehaaeus 22.4 1S 4.62 — 7.4 —1.60 
errr ret 78.9 95 8.13 16.1 1.98 
Total L.. (RdeeeeaG 144 
eee sere ee 22.4 24 4.62 1.6 35 
Re 6'5 thiamin nade 11.7 8 3.38 — 3.7 — 1.09 
ik a ie edistisinidb abe tedeail 41.4 44 6.15 2.6 42 
ree 76 
schema ion ainda 78.9 85 8.13 6.1 75 
ere oe 41.4 54 6.15 12.6 2.05 
ERS 146.1 128 10.11 —18.1 —1.79 
Nd i dk nia selk 267 
, ) . ) 486.1 487 
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TABLE X 
D/o ror Totats For Eacu § 











S H 2 F A Oy T Sn 
| 53 — .98 —1.32 —1.55 —3.21 .23 — 1.42 
LE......} —2.29 — .O1 85 .29 —2.77 —.79 — 1.60 
ae 1.57 .86 59 .78 3.35 .42 1.98 
EL.. — .46 2.12 54 .46 1.22 .46 35 
EE.. 2.11 —2.71 — 1.90 — 1.00 — 2.02 .0O — 1.09 
EH... — 1.49 82 1.00 62 .gO —.13 .42 
a .22 —1.15 .20 .46 .69 —.35 75 
HE......| — .18 2.68 1.00 1.12 3.17 .30 2.05 
HH......| — .02 —1.55 — 1.02 —1.50 — 2.87 —.18 —1.79 


























closed by these results. The only pairs of judgments showing 
consistency of direction are the LH’s which are positive, and 
the HH’s which are negative, for all Ss. In addition there 
seems to be a tendency in 5 Ss (C, F, A, Oy, Sn) toward 
avoiding the making of two consecutive judgments in the 
same category. For these Ss the values for LL, EE, and HH 
are not only negative in all cases but in general are fairly large. 
The consistency of direction of these values, we believe, gives 
additional evidence for their reliability beyond that indicated 
in the ratios themselves. On the basis of our data, which 
may be regarded chiefly as illustrative of the possibilities of 
our technique, we are unable to generalize further. It re- 
mains for future experiment to determine the true extent 
both of individual differences and of common trends. 


4. Overestimation 


The results indicate the operation of another factor, that 
of overestimation of the second weight. Referring to Tables I 
and II, we find that in the case of each S the total frequency of 
‘heaviers’ exceeds that of ‘lighters.’ Except for F, this differ- 
ence is large for all Ss. This is much as would be expected 
in view of the well-known fact of overestimation of the second 
stimulus in psychophysical procedures. The problem arises 
as to the relationship between this overestimation and what 
generally is called time error. This problem is somewhat 
beyond the scope of the present paper, and requires special 
investigation. This is now in progress. 
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SUMMARY AND CONCLUSIONS 


Psychophysical judgments on objectively equal weights 
were obtained from seven Ss. On the whole, ‘heavier’ 
judgments showed the least variability and ‘lighter’ judg- 
ments the most. An analysis of the results revealed no ap- 
parent effects attributable to ‘degree of sophistication’ of the 
Ss, practice, or fatigue. The effect of previous comparisons, 
however, seemed quite definite in many cases. Five Ss, for 
example, showed a tendency toward avoidance of the repeti- 
tion of judgments in the same category. However, the neces- 
sity for further research was indicated. All Ss showed evi- 
dence of overestimation of the second weight. 

Our technique is offered as one particularly well adapted 
to the study of certain factors involved in the making of psy- 
chophysical judgments, such as the effect of previous com- 
parisons, which are difficult of analysis by the usual methods. 
It is suggested that the method may prove fruitful in attack- 
ing factors which have not been discussed in this paper, such 
as attitude, the effect of instructions, and temperamental 
type, as well as the investigation of other variables which can 
be attacked best by means of simplified experimental condi- 
tions such as are embodied in this technique, rather than by 
the more complex settings of the customary psychophysical 
procedures. 


(Manuscript received October 8, 1931) 











INFLUENCE OF AN INTERPOLATED ELECTRIC 
SHOCK UPON RECALL 


BY M. M. WHITE 
University of Kentucky 


The effect of interpolated emotional stimuli upon recall 
has been studied by Robinson, Skaggs, Harden, Stone, and 
others. All agree that the interpolation affects recall; but 
the nature of the effect is not clear. Harden (5) found that 
an electrical stimulus aided the recall of eleven subjects, 
hindered it in twenty-five, and had no effect on eight. What 
was the cause of the varying effect? <A possibility is that it is 
due to varying degrees of the intensity of the emotional 
disturbance. 


One measure of the intensity of emotional disturbance that 
has been used in the past is the galvanic reflex. To be sure 
there is no agreement among psychologists as to its validity. 
There is agreement, however, that the galvanic reflex occurs 
during emotional disturbances. The present writer sought by 
the use of the galvanometer to discover whether the effect 
upon recall of the interpolated emotional situation is related 
to the amount of disturbance as measured by the galvanom- 


eter. 
PROCEDURE 


Four lists of fifteen nouns of three letters each were used. Each list was presented 
one time to each subject on the Ranschburg memory apparatus. Each word was 
exposed to view for two seconds with no appreciable intervening interval. After an 
exposure the subject was either given an electrical stimulus and then read a book 
for twelve minutes or read immediately for twelve minutes. As the shock delayed 
the reading no more than ten seconds no substitute activity other than reading was 
provided. 

The subject was requested not to think about the words during the time he read 
the book. After the twelve minute interval he was given pencil and paper and asked 
to write down all the words he remembered in the list. When the subject had finished 
he indicated the fact and the time he had spent in recall was recorded. The subjects 
were allowed to work as long as they wished, and were not aware that they were 
being timed. 

After he had written all the words he could, the subject was told that he would 
not be tested again on that particular list, and instructed to forget the words as much 
as possible. He then read for five minutes. 
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At the end of five minutes a new list was presented and the above procedure was 
repeated until each subject had seen four lists after two of which he was shocked. 
The order of presentation of the lists and the position of the shock varied from subject 
to subject. For example, if one subject was given the lists in the order of DC AB 
and shocked after D and after A the next subject might be given the lists in the order 
of ABCD and shocked after B and C. The lists were given to every subject in a 
different order. By this procedure we attempted to control such factors as practice, 
the relative difficulty of the lists, fatigue and any possible effect of the shock due to 
position in the series. Those lists after which the shock was given are referred to 
as the experimental lists; the others are the control lists. In treating the results of 
any given subject the experimental lists are combined and the control lists are combined. 
A few subjects were given only two lists, receiving a shock after one of them. 

The galvanometer was connected as follows: The electrodes were made of zinc 
and were about three centimeters in diameter and two millimeters in thickness. These 
electrodes were covered with a paste made of kaolin, an extremely weak solution of 
zinc sulphate, and water. The hook-up used was one suggested by Darrow (2, 418- 
420). Briefly it is as follows: The leads from the electrodes were connected in series 
with a ten thousand calibrated ohm resistance and in turn to the X arm of the wheat- 
stone bridge. A shunt was inserted across the galvanometer poles. A one hundred 
thousand ohm resistance was so hooked up that it could be substituted for the subject; 
that is, a two-way switch was inserted so that either the subject and the ten thousand 
calibrated resistance were in the X arm of the wheatstone bridge, or the one hundred 
thousand ohm resistance and the calibrated resistance of ten thousand ohms were in 
the X arm of the wheatstone bridge. The deflections of the galvanic mirror were 
traced on a moving piece of paper. The paper was sixteen centimeters wide and was 
ruled lengthwise—a line occurring every two centimeters. 

The procedure for measuring the galvanic resistance was as follows: The electrodes 
were attached; the wheatstone bridge was balanced with the resistance in series with 
the subject at five thousand ohms. The subject was allowed to rest ten minutes. 
The wheatstone bridge was again balanced. ‘The one hundred thousand ohm resistance 
was substituted for the subject and adjusted until it balanced the wheatstone bridge. 
The shunt of the galvanometer was adjusted until a two centimeter deflection was 
caused by a change of five thousand ohms. Then the subject was substituted for the 
one hundred thousand ohm resistance. ‘The words were presented, and the galvanic 
deflections were followed on the moving paper. In the experimental series the shock 
was given immediately after the list of words was presented. The instant it was 
given was marked on the moving paper and the galvanic deflections recorded for ten 
seconds after the shock. The one hundred thousand ohm resistance was substituted 
for the subject once more and changed until the wheatstone bridge was balanced. 
The calibrated resistance was then changed until the maximum deflection of the 
galvanic mirror as traced on the paper was reproduced. This measurement represented 
the amount the subject’s resistance changed after the shock. In order to make 
subjects comparable the ratio of this change to the subject’s resistance just before 
the words were presented was computed and stated in terms of percent. 

Thirty subjects were used; twenty-two men and eight women. The subjects 
were students in the elementary psychology classes. 


RESULTS 


From Table I we see that: (1) thirteen subjects recall 
more words in the experimental series than in the control 
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TABLE I 


VARIATIONS IN NUMBER OF Worps RECALLED, TIME, AND GALVANIC RESISTANCE 








All _S’s who S’s who S's who did 
subjects improved | did worse the same 





Ex- | Con-| Ex- | Con-} Ex- | Con-| Ex- | Con- 
per. | trol | per. | trol | per. | trol | per. | trol 























as eb dgne ews eae 30 130 (13 «|130 «JIS [IS 2 2 
Av. number of words recalled......... 6.93] 6.33] 9-54] 5-23] 4-73] 7-27] 6.5 | 6.5 
Av. time per word recalled (seconds). . .|21.59]15.98]13.97/19.16/22.70|20.94| 18.85 10.38 
Av. percent change G. R............. 41.16 56.90 25.26 37.00 











series; fifteen recall fewer words in the experimental than in 
the control series; and two subjects recall the same number of 
words in the two series. (2) The average gain shown by the 
subjects who improve is greater than the loss shown by the 
subjects who are hindered by the shock. This is true whether 
we compare the experimental series with the control series 
within the same group or with the total number of subjects. 
(3) The average time per word recalled is reduced after shock 
for the subjects who gain in recall but is greater for subjects 
who lose or remain the same. (4) The average change in 
electrical resistance in terms of percent is twice as great for 
the subjects who gain in recall after shock as for the subjects 
who lose (56.90 as compared to 25.26). 

The relation between change in resistance and recall after 
shock is shown in Chart I. Thirty-six percent may be looked 
upon as a rough critical score. Of the sixteen subjects whose 
change was less than thirty-six percent, twelve showed a loss 


Cuart I 
ELECTRICAL RESISTANCE AND RECALL 


Loss in words o Gain in words 














100% 
Change of o S’s 7 S’s 
galvanic 62% 
resistance 
3 S’s 3 S’s 
37% 
12 S’s 3 S’s 
0% 
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in recall, three showed a gain, and one remained constant. 
Of the thirteen subjects whose change was greater than thirty- 
six percent, ten showed a gain in recall after shock, three 
showed a loss and one remained constant. The seven sub- 
jects whose resistance changes were greatest also showed the 
most improvement in recall after the shock; that is, the seven 
subjects whose resistance changed as much as sixty-two per- 
cent did better during the experimental series than in the 
control series. 

The explanation of the varying effect of the electrical 
stimulus may lie in the following factors: 1. The factors of 
retroactive inhibition, 2. Changes in tonicity, 3. Absence or 
presence of an emotion, 4. Unitary experience of the electrical 
stimulus and the words to be learned. 

In the case of the fifteen subjects who were hindered by 
the electrical stimulus, the explanation must lie in those fac- 
tors which explain retroactive inhibition, such as breaking 
of set, as indicated by Mueller and Pilzecker. These factors 
of course operated also in the case of the thirteen subjects who 
improved as the result of the electrical stimulus. In these 
subjects it is to be remembered the electrical stimulus elicited 
the greatest galvanic change. ‘They probably had the great- 
est increase in tonicity as the result of the electrical stimulus. 
The evidence for the higher tonicity is of two kinds: first, 
lowered resistance accompanies or is the result of increased 
muscular tension (8, 9); second, the speed of work was greater 
in these subjects. The electrical stimulus and subsequent 
higher tonicity occurred so closely after the presentation of 
the series of words that they were experienced together. 

The above explanation is in accord with the work of Bills 
(1) who found that more efficient work was done by subjects 
when they were tense than when they were relaxed. If our 
supposition is correct Bills would have had similar findings if 
his subjects had increased their tenseness immediately after 
the presentation of the words, though the effect would not 
have been so pronounced. 

It has been suggested by some experimenters (6) that 
whereas large changes in electrical resistance indicate the 








756 M. M. WHITE 


presence of emotion small changes do not. Darrow (3) sug- 
gests that emotions may have two aspects, one ‘‘the imme- 
diate reflex response to ‘sensory’ excitement” and the other 
‘“‘the response mediated by associative processes or ideas 
aroused by the stimulus.” It is conceivable that if only the 
‘immediate reflex response’ occurs there is only a small gal- 
vanic change and that there is no emotion. In our subjects 
those whose galvanic change was less than thirty-six percent 
experienced no emotion; those whose galvanic change was 
relatively great did experience an emotion. If this view is 
correct, our results are in accord with the suggestion given by 
Dashiell (4) that mild emotions are helpful though extreme 
emotions are a hindrance to efficiency. 

In the case of the two subjects who were neither helped nor 
hindered by the electrical stimulus, there must have been a 
balancing of facilitating factors and retroactive inhibitory 
factors. 

SUMMARY 

Electrical stimuli were interpolated between learning and 
recall. The galvanic deflection was recorded. It was found 
that: 

1. Electrical stimuli occurring within ten seconds after the 
presentation of a list of words to be learned facilitated recall 
in thirteen out of thirty subjects; hindered recall in fifteen 
subjects; and had no effect upon two subjects. 

2. Relatively small changes in galvanic resistance accom- 
panied a loss in recall and relatively large changes in galvanic 
resistance accompanied a gain in recall. 

3. The average time per word for recall after shock was 
less for subjects who showed a gain in the number of words 
than in any other series. 

An explanation in terms of increased tonicity is suggested. 


(Manuscript received July 25, 1931) 
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OCULAR DOMINANCE IN YOUNG 
CHILDREN 


BY RUTH UPDEGRAFF 


Towa Child Welfare Research Station 
State University of Iowa 


Iowa City, Iowa 


That most individuals are inconvenienced to such a slight 
degree by normal diplopia is not surprising. The fact that 
our attention is primarily concerned with the object upon 
which our eyes are focused is an important factor in making 
us disregard the double images of other objects in front of that 
object and behind it. Moreover, when forced to consider two 
objects in relation to each other we are able to disregard the 
image of one eye and attend only to the image of the other. 
One eye dominates the other, so to speak. Of this many of 
us are quite unaware. 

Although this ignoring of one image in order that the other 
may become clarified has been recognized for some centuries, 
it is only in recent years that ocular dominance has been 
studied with any consistency or exactness.) Interest in ocular 
dominance has increased with knowledge of dominance of other 
types, particularly hand dominance. The possibility of the 
existence of relationships between various types of dominance 
has encouraged analytic studies in this field. 

It is true that under certain conditions an individual is 
forced to use one eye or the other. This applies not only to 
such circumstances as those in which sighting is involved, but 
also to others in which both eyes are in use and directed at the 
same object. The individual is not conscious that the image 
of one eye dominates that of the other, for to the best of his 
knowledge he is seeing with both eyes. It has been fairly 
well established in the case of adults that one of the eyes is 
consistently dominant in those situations in which there must 


1Dr. Walter Miles has contributed two discussions of the history of the study of 
this subject (2, 3). 
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be a choice between the two images. Just why this is true is 
still unanswered. The present study was undertaken with the 
hope of determining how early in life this dominance may be 
found. 

Nearly all of the methods used in previous studies of 
ocular dominance are unsuitable for experimental work with 
young children inasmuch as they necessitate verbal responses 
based upon judgments, or subjective accounts of the visual 
field. It seemed, however, that the underlying principle of 
the use of the V-scope designed by Dr. Miles (2) was feasible. 
This simple apparatus is easily manipulated by the observer. 
It consists of a truncated, flattened cone, 12 inches long, large 
enough at one end to enclose the forehead and both eyes, and 
small enough at the other so that the visual field 6 feet 
away is limited to a diameter of some 12 inches. Looking 
through this cone at the object 6 feet distant the observer can- 
not see it with both eyes because the opening of the cone is too 
small. He must so hold the cone that one eye or the other is 
used. He is quite unconscious of this, however, and since 
he is asked to make some judgment concerning the distant 
object, his attention is directed at it and not at the V-scope. 
To the observer the V-scope is a means, not anend. On the 
other hand the examiner, who is holding the object, can readily 
see which eye the subject is using by noting which eye shows 
through the small end of the cone. The dominant eye is thus 
determined independently of verbal report. There is, also, 
less chance for one hand or the other to influence the result 
since both hands are needed to hold the V-scope. 


METHOD OF EXPERIMENT 


Subjects 


The children participating in this study were all in attend- 
ance in the preschool laboratories of the Iowa Child Welfare 
Research Station. One hundred ninety children from two to 
six years of age were tested. One hundred thirty-three of 
these were tested twice, thirty-eight three times, eleven four 
times, and one five times; twenty-three were given additional 
retests in the first and second grades. 
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Apparatus 


At the time this study was begun, an attempt was made 
to use the linen V-scopes originated by Miles (2). It was 
found that although they were fairly satisfactory for the 
testing of five-year-old children, they were not fulfilling their 
purpose with the younger children. The collapsibility of the 
original V-scope, an advantage in the testing of older subjects, 
is a real problem for the young child. He finds the V-scope so 
difficult to keep open that he cannot look through it as he 
wishes. To overcome this disadvantage metal tubes were 
made. Sheet-zinc cones, identical to the V-scope in dimen- 
sions and pattern, proved easy for the children tohold. Edges 
of the metal were turned back and pressed down so that all 
surfaces were smooth. The metal cones had an additional 
advantage in that they could be disinfected easily. 

The objects to be viewed were the ten small pictures form- 
ing part of the Miles test. These are outline sketches on 
2 X 3 inch cards of such figures as a woman’s head, a cat, a 
bicycle. Although all of these drawings are not particularly 
suited to the interests of young children, some of them are 
sufficiently simple and familiar to be comprehensible even to 
the youngest ones; others need careful scrutiny by the oldest. 
There is, therefore, a quite satisfactory mixture of easy ma- 
terial to give encouragement, and difficult material to insure 
careful inspection by the child. It will be remembered that 
for test purposes these pictures are important only instru- 
mentally and the correct naming of them by the children has 
no significance. 

Procedure? 


The child is brought into the testing room where there are 
two small chairs facing each other, 6 feet apart. On a table 
nearby are three V-scopes. The examiner picks up one of 
these, saying, “‘Do you see this? It’s a long thing for you to 
look through. Wecall ita V-scope. See—you hold it up like 


? For the collection of approximately one-fourth of the data reported in this 
study the author is indebted to her class in experimental psychology of childhood, 
1929-1930, and to Miss Lois Jack, research assistant in the lowa Child Welfare Re- 
search Station. 
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’ 


this.” (The examiner demonstrates, sitting down as he does 
so.) ‘“‘Youtryit. Now I’m going to show you some pictures 
and you can see what they look like through the V-scope 
Suppose you sit in this chair.”” The examiner sees that the 
child is comfortably seated in the chair directly facing him. 
[“‘Now you hold this in your hands” (placing the V-scope in 
his hands in the correct manner) “‘and see if you can find the 
picture [ shall hold in front of my nose.”” (The examiner holds 
the picture as indicated and waits for the child to find it and 
look at it.) ‘‘Good. Now I’m going to give you another 
V-scope.”] The examiner changes the V-scope. ‘‘Now, I’m 
going to sit over here and show you some pictures. You tell 
me what they are.” 

The examiner shows ten pictures, recording whether the 
child is looking with the left or the right eye. The V-scope 
is changed between each picture so that a new adjustment is 
assured each time. Before a picture is shown the examiner 
repeats the part of the instructions included between the 
brackets. 

During the first two years this test was given, all the chil- 
dren were shown ten pictures. Since it was found that if a 
child shifted in the use of eyes he did so early in the series, in 
the third year of the experiment the plan of presentation was 
slightly changed; only the first five pictures were shown to 
those children who responded similarly in all five. If there 
was a shift in use of the eyes all ten pictures were shown. 

If a child used the same eye eight, nine, or ten times in the 
ten chances he had, he was designated as using that eye 
dominantly. Those children who did not respond so de- 
cidedly but who used one eye nearly as frequently as the other, 
that is, in five, six, or seven times out of ten, were said to be 
indefinite in eye dominance. 


RESULTS 


The experiment proved to be an interesting one to the 
children. The metal tubes were easy to handle, there seemed 
to be an added motivation in the fact that the V-scopes had to 
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be changed for each picture, and the task of identifying the 
pictures seemed to be challenging. 


intervals of not more than two months. 


Reliability 
In order to determine whether the results of one test might 
be considered representative, repeated tests were given at 


lation presents the data thus secured. 





The following tabu- 

















Same Eye Dominant 
Age, Children Different Eye 
Years Dominant 
Left Right Both 

2 13 2 Oo 2 9 

3 19 6 9 I 3 

4 27 8 15 I 3 

5 45 10 27 3 5 
Total 104 26 51 7 20 




















The percentage of agreement between first and second 
tests of three-, four-, and five-year-old children is high. 
Dominance was the same in the case of 88 percent of these 
children. Apparently the test was not reliable for two-year- 
old children since so large a proportion of their number re- 
acted differently in the two tests. This may mean that the 
test is not a satisfactory one for such young children or that it 
is suitable but dominance has not been established. In addi- 
tion, the number of cases is small. In consideration of the 
above facts the remainder of the study is limited to the results 
of tests of three-, four-, and five-year-old children. 

The reliability of the test, when used with the three older 
ages, compares well with that found by Miles (2) for first and 
third grade children (namely, from 75 to 95 percent), and for 
children and adults (3) (approximately 80 percent). 


Frequency of Dominance Types 


The following tabulation showing the relative frequency 
of occurrence of dominance types in three-, four-, and five- 
year-old children does not include data from tests found un- 
reliable: 
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Percent 
Fal Children 
Left Eye Right Eye Indefinite 
3 39 20.5 64.1 15.3 
4 71 22.5 66.1 11.2 
5 88 35.2 57-9 6. 

















Evidently the relative frequency of right- and left-eyedness 
as shown by this test does not vary significantly with age at 
these early years. Right-eyedness occurs some two and a 
half times as often as left-eyedness. Here, as well as in relia- 
bility, there is agreement with the results of other investigators 
observing older subjects. Parson (4) using his manuscope, 
found 69 percent of 877 public school children right-eyed, 29 
percent left-eyed. By a similar method, Cuff (1) obtained 
with a group of 146 school children percentages of 62 and 29 
respectively. Miles’ (2, 3) right-eyed subjects represented 
approximately 64 percent of his group, while he found approx- 
imately 34 percent left-eyed. 

There seems to be some indication that indefinite eyedness 
or the tendency to use one eye at some times, the other eye 
at other times, decreases with age in the early years. This is 
indicated in the present study and the conclusion is strength- 
ened by data of others. This study shows fewer instances of 
indefinite or impartial eyedness at five years than at three. 
In older subjects Parson found indefiniteness in 1 percent of 
his cases, Cuff in 9 percent, Miles in 2 percent. 

From the above results concerning reliability and fre- 
quency of eye dominance, it seems justifiable to believe that 
for many children ocular dominance, as indicated by this test, 
is not established before the age of three, and that some chil- 
dren do not show dominance for several years afterwards. It 
is not known whether dominance is ever established in the 
case of some of these individuals. Pertinent results in this 
study are discussed later. 


Sex Differences 


Table 1 indicates that there are no sex differences greater 


than the possibility of error. 
50 
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TABLE 1 


Ocutar DomInaANcE IN THREE-, Four-, AND FIVE-YEAR-OLD CHILDREN 























Percent 
pee. Children 
ears 
Left Eye Right Eye Indefinite 

Boys 
3 26 23.0 57.6 19.2 
4 35 17.1 71.4 11.4 
5 42 35-7 59.5 4.8 

Girls 
3 13 15.3 76.9 7-7 
4 36 27.8 61.1 11.1 
5 46 34.8 56.5 8.6 





Consistency of Eyedness Over a Period of Time 


In order to throw more light on the problem of stability of 
eyedness, sixty-three children were given several successive 
tests, the total intervening time varying from six months to 
two years. The following tabulation shows the results: 

















: a: Same Eye Dominant = 
nterval Between : ominance 
Tests Tests Children Shifted 
Right Left Indefinite 

3 RN ic whe aews 13 6 I fe) 

2 0 Parte 39 18 8 2 II 

2 6 months....... II 4 4 Oo 3 
Total 63 28 13 2 20 























Additional information was gained from tests given to 
twenty-three six- and seven-year-old children who had been 
last tested at the age of five. Of the twenty-three, seventeen 
used the same eye dominantly as in the previous tests. Inthe 
whole group there were fifteen right-eyed children, seven left- 
eyed, and one apparently able to use either. The same chil- 
dren in the last preceding test were classified as follows: 12 
right-eyed, 10 left-eyed, and one who used either eye. A 
larger percent of the children originally left-eyed had shifted 
than was the case with those originally right-eyed. 

Earlier in this paper the question was raised concerning the 
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eventual establishment of dominance in children who were 
impartial-eyed at the age of three. Of the sixty-three children 
in the above table eight were impartial-eyed at the first test. 
At the time of the last test, two of these still remained in that 
classification, while six had become either right- or left-eyed. 


Relation to Handedness 


Previous investigators who have studied the relation be- 
tween hand and eye dominance have found that right-eyed 
persons are more apt to be right-handed than are the left-eyed 
to be left-handed. The incidence of left-eyedness is greater 
than that of left-handedness. For several reasons it is not 
possible to be definite about these dominance relationships in 
young children. There are as yet no proved methods of 
determining handedness nor exact knowledge of its nature. 
Handedness tests of young children have included anatomical 
measurements, tests of strength, skill, and customary usage. 
Investigators have been prone to choose a single criterion, or, 
in the face of varying results with several criteria, have been 
in doubt concerning correct interpretation. 

After giving a battery of tests of handedness to seventy- 
five of the children used in this study, it is possible to state that 
such handedness tests as are available indicate results similar 
to those found in the case of adults. However, the reliability 
of the tests in the battery varied; some were not validated by 
observational data. At the present time a more comprehen- 
sive study of handedness is in progress and it is hoped that 
in the near future valid and reliable data upon which to base 
correlative studies will be available. 

In the course of this study two cases of interest in relation 
to this matter were noted. Between the ages of four and 
seven two children shifted in hand preference—one from right 
to left, the other from left to right. This information is based 
on teacher reports. At the time their ocular dominance was 
first determined each was identically dominant in eye and 
hand. Inthe following year, the year of transition in handed- 
ness, the test showed no eye dominance present. The next 
test indicated that the ocular dominance had shifted with the 
hand. One of these children was the subject of his parents’ 
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attempts to change his handedness. The circumstances of 
the other instance are not known. 


SUMMARY 
1. A modification of the Miles A-B-C Vision Test was 


used in 396 tests of children two to six years of age. 

2. The test proved to be reliable for children from three to 
six years of age. 

3. This method did not reliably test the two-year-old 
children to whom the test was given. This may mean that 
the test is not a satisfactory one for such young children, or 
that it is suitable but dominance has not been established. 

4. The relative frequency of right-eyedness and left-eyed- 
ness in children from three to six years old is comparable to 
the frequency of their occurrence in older children and adults, 
as found by previous investigators. Approximately 60 per- 
cent of the children were right-eyed and from 20 to 30 percent 
left-eyed. 

5. Results indicated that impartial-eyedness is more com- 
mon in young children than in older. It occurs more fre- 
quently in three- than in five-year-olds. From a comparison 
of these results with those of others studying older children, 
indications are that impartial-eyedness is more common in 
preschool children than in children of school age. 

6. There were no apparent sex differences. 

7. It seems justifiable to believe that for many children 
ocular dominance as indicated by this test is not established 
before the age of three. Thereafter, however, it is commonly 
found. 

8. Consistency in ocular dominance over a period of years 
was apparent in approximately 75 percent of the children of 
whom such testing was possible. 


(Manuscript received May 19, 1932) 
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THE AUTONOMIC NERVOUS SYSTEM AS A FACTOR 
IN THE PSYCHOGALVANIC REFLEX 


BY W. D. O’LEARY, S.J., M.D. 
St. Louts University 


The part played by the autonomic nervous system in the 
production of that somewhat enigmatic phenomenon known 
as the psychogalvanic reflex has been frequently emphasized 
by many authors. The various theories held at the present 
time concerning this and other aspects of this reflex may be 
found summarized in a very satisfactory manner by Landis 
and DeWick.! 

The object of this paper is not to add another theory to 
those already existing but to present some rather interesting 
laboratory findings. Experimenting on ourselves and stu- 
dents in the psychological laboratory of St. Louis University 
we focused our attention on factors which might influence the 
autonomic, eliminating as far as possible all emotional stimuli 
and muscular movements. The factors selected as most likely 
to produce appreciable effects on the autonomic were very 
deep breathing, shallow and rapid breathing, the holding of 
the breath and increased heart action. These have been used 
by other investigators, notably Radecki.?, With this author, 
Vigouroux and Piéron, we also felt that vasodilation should 
give considerable information. 

The results of these experiments were far from satisfactory 
for although we obtained changes in resistance, and often 
very marked changes, there seemed to be no way of assuring 
ourselves that we were measuring only autonomic changes 
to the exclusion of others such as the hydrogen ion concentra- 
tion of the blood. 

The only significant thing shown by these experiments on 


1 Landis, C., and DeWick, H. N., The electrical phenomena of the skin, Psychol. 
Bull., 1929, 26, 64-115. 

2 Radecki, W., Recherches experimentales sur les phenomenes psychoelectriques, 
Arch. de Psychol., 1911, 11, 209-293. 
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human subjects was that there was a constant and gradual 
drop in the resistance which seemed to be hastened by the 
breathing etc. But when a certain ‘base level’ (which varied 
in each subject) was reached the factors which had previously 
affected the resistance no longer had any influence. This 
may have been a part of the phenomenon of ‘continued 
lowering of resistance’ as described by Thouless.’ 

This continually decreasing resistance seemed to be mark- 
edly accelerated if a tourniquet were applied to the arm in such 
a way as to cause considerable distention of the blood vessels 
of the hand to which the electrodes were attached. For this 
purpose we used a sphygmometer which caused no disturb- 
ance from traction or muscular movement. With the blood 
vessels thus distended deep breathing produced very wide 
and sudden swings in the galvanometer needle. In this in- 
stance also we were unable to reproduce this phenomenon 
after the resistance had reached what we choose to call a 
‘base level.’ It is possible that polarization played some part 
in this (we were using nickel electrodes at the time) for we 
were unable to elicit similar drops with a non-polarizing elec- 
trode. 

It became evident that the best way to eliminate question- 
able factors and to get down to purely autonomic influence 
was to make use of experimental animals where the autonomic 
could be altered to suit our purposes. 

Dr. Kuntz of the Neural Anatomy Department very kindly 
gave us the use of several cats whose inferior cervical ganglion 
had been removed on one side only. This ganglion had been 
so removed that no sympathetic fibers whatever remained 
connected with one of the front paws while the fibers to the 
other front paw were not disturbed. Such an arrangement 
gave us an excellent means for comparing the resistance under 
identical conditions in all save the autonomic supply. Even 
the question of polarization and other physical factors which 
might enter could not destroy or interfere with the results 
which would be taken from a comparative point of view. 

The hairs on both front paws were completely removed 


3 Thouless, R. H., Causes of continued change in resistance, Brit. J. Psychol., 
1927, 17, 5-14. 
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with sodium sulphide and electrodes which had been made 
especially for this purpose were securely fastened to the flexor 
and extensor surfaces of each paw. ‘These electrodes consisted 
of small copper plates to which copper wires had been soldered. 
They were made small enough to include only the ‘cushion’ 
part of the paw since this part alone contains sweat glands. 
They were kept moistened with gauze saturated with normal 
saline solution. 

In the first series of experiments the cats were not given 
ether but after being placed comfortably in the arms of an 
assistant and gently stroked they became quickly resigned to 
the unusual apparatus and, although seldom absolutely free 
from some excitement, made few attempts to get rid of this 
incumbrance. 

Readings were then obtained of the resistance of each 
paw. A Leeds Northrop resistance box with multiple resist- 
ance coils and a sensitive galvanometer were used to measure 
this resistance. A three-way switch was used at first but soon 
discarded on account of the fact that too many new points 
of resistance were thereby introduced. Direct connections 
were used in all the recorded experiments. 

After taking readings every two or three minutes for 30 
minutes or so it was found that the resistance fluctuated con- 
siderably on the sound side while the resistance of the oper- 
ated paw remained absolutely the same. The cat was of 
course continually becoming restless and had to be quieted 
but whether quiet or restless the operated paw showed not the 
slightest change in resistance. 

Marked emotional states were then produced by pinching 
and twisting the tail of the cat. In these states of extreme 
excitement the sound paw went through rapid and wide 
fluctuations in resistance but there was still no change what- 
ever in the operated paw. The paws were firmly held so that 
the changes were not due to muscular movements. If this 
had been the case there is no reason why the operated paw 
should not have produced the same or at least some deflections 
of the needle. The same experiment was repeated on other 
cats similarly operated, that is with the inferior cervical gang- 
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lion completely removed on one side. In every case the sym- 
pathectomized side showed no change at all in resistance. 


TABLE NO. 1 


Unoperated Paw Operated Paw 
(corresponding readings) 
5,400 Ghee (encited). .. 0... ccc cccccccccccscccsces POOCRMS 
i TI CINE. oon os kia cccccecccscescsncssss FUDGE 
goo ohms, slightly excited.........................+ 700 ohms 
ON ee 
I II, onc cs cc cessasencvasneccescecsces SODGR 
2,100 ohms, markedly excited. ...............++...++. 700 ohms 
(Period of rest 

EET EEE TT er ee eT 700 ohms 
ee 
I, CIE CII a ono bis cc ccesndccesoneves 700 ohms 


Note—Only a few minutes elapsed between readings. 


In the next series of experiments these cats were anesthe- 
tized and after getting readings from both paws, I cc. of a 
I—10,000 solution of Adrenalin Chloride was given subcu- 
taneously to the first cat. Readings were taken every 4 or 
5 minutes for 45 minutes. ‘There was practically no change 
in the resistance of either paw. If the adrenalin subcutane- 
ously were effective it was expected that changes in the resist- 
ance of the intact side would be produced while the operated 
side would remain the same, just as in the cases of emotional 
excitation. This was not the case however for there were 
slight variations in the resistance of both paws, the operated 
as well as the unoperated. The readings from the unoperated 
side ranged from 350 ohms to 440 ohms, while the range on 
the operated side was from 410 to 480 ohms. 

It may be that the adrenalin when given subcutaneously 
had some effect on the muscles directly, but it seems more 
likely that these slight changes were more or less accidental, 
for on removing the electrodes after the experiment it was 
found that on both sides they had become somewhat loosened, 
consequently the contacts were not good. This had not 
happened in any of the other experiments. This explanation 
seems all the more likely in view of subsequent experiments 
where subcutaneous adrenalin produced no change on the 
operated side while the sound side showed considerable 
fluctuation. 
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The adrenalin was then given intravenously to another 
cat with very good results. One cc. of a I-10,000 solution was 
again used. Here again the readings from the sympath- 
ectomized side showed no fluctuations whatever while the 
other side changed considerably. Within five minutes after 
the injection of the adrenalin intravenously the resistance on 
the sound side went rapidly from 620 ohms to a maximum of 
980 ohms, after which it began to drop slowly until a half 
hour after the injection of the adrenalin the resistance on this 
side was 8700hms. During all this time the readings from the 
sympathectomized paw remained absolutely the same, that is 
the resistance was 470 ohms before the ether was given, 470 
ohms after ether but before adrenalin and 470 at every reading 
in the half hour after adrenalin. 


TABLE NO. 2 
Unoperated Paw Operated Paw 
400 ohms before ether...............+.. 470 ohms before ether 
620 ohms just before adrenalin. ......... 470 ohms after ether 
g80 ohms 5 min. after adrenalin........ 470o0hms 5 min. after adrenalin 
960 ohms 20 min. after adrenalin........ 470 ohms 20 min. after adrenalin 
870 ohms 25 min. after adrenalin........ 470 ohms 25 min. after adrenalin 


It might be well to note that the blood vessels in the sym- 
pathectomized paw were, as one would expect, markedly 
dilated and remained in this condition. As a consequence the 
blood volume of the operated side remained much more con- 
stant than the side still controlled by the sympathetic. It 
has been pointed out by Uhlenbruck‘ that blood volume 
changes produce resistance changes. It does not seem how- 
ever that constant blood volume alone accounts for the ex- 
traordinary immutability in resistance, for it is doubtful if the 
blood volume of the sympathectomized side remained so un- 
varying. Other factors which occurred during the experi- 
ment should have influenced the blood volume of both paws. 
For example, at times the paws were sharply elevated, at 
other times somewhat pendent; sometimes the assistant hold- 
ing the cat was exerting considerable pressure on the proximal 
part of the leg, while at other times it was perfectly free. In 


‘Uhlenbruck, P., Uber den Autonomen Reflex, Zeitschr. f. Biol., 1924, 81, 51-56. 
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spite of all these changes the readings from the operated paw 
never changed so much as an ohm. 

Gildmeister® thinks that the psychogalvanic reflex is 
merely a part of the general autonomic response. The results 
of our experiments seem to show that the autonomic nervous 
system plays a tremendously important part, if not the only 
part, in the psychogalvanic reflex. The remarkably constant 
readings from the sympathectomized paw could hardly be 
due to stability in the permeability of the cell membranes, 
which would otherwise be regulated by the autonomic as 
Gildmeister at one time postulated. The influence appears 
to be much more direct. ‘The same may be said with regard 
to the sweat glands for although they are regulated by the 
sympathetic still in the case of cats with the inferior cervical 
ganglion removed, entire absence of variation in resistance 
seems difficult to explain by the absence of sweat gland activ- 
ity alone. It seems difficult to explain for this reason, namely 
that the experiments were performed on cats at longer or 
shorter intervals after the sympathectomy. Use was made 
of one cat so soon after the sympathetic operation that he was 
still stuporous from the effects of amytol which had been 
given for that operation. At least in this last case it seems 
likely that there were some kind of changes going on in the 
sweat glands, sufficient to make a difference in resistance of a 
few ohms. 

The findings here presented have not been checked from 
as many angles as the author would wish, nevertheless the 
influence of the autonomic on the psychogalvanic reflex may be 
seen from the above cited experiments even without more 
profound and exhaustive investigation. 

Since the cats used were all from the experimental labora- 
tory of Dr. Albert Kuntz I feel that we can be reasonably 
certain that the condition of the autonomic was in every in- 
stance as stated in the beginning of this paper. 


5 Gildmeister, M., Uber die elektrischen Leitungseigenschaften der Saugerhaut, 
Pfhliiger’s Arch. f. d. g. Physiol., 1922, 194, 323-325. 


(Manuscript received August 6, 1931) 








THE REGULATED ALTERNATING CURRENT AS 
A TIME MEASURE 


BY THOS. H. HOWELLS 
University of Colerado 


Within the last two or three years a new and improved 
means of time measurement has come into common usage by 
the general public. The synchronous electric clock is the 
agency which supplies this nearly-perfect record of the flight 
of time. The chief virtue, however, does not lie in the clock. 
The latter consists merely of an electric motor of the well- 
known synchronous type and a suitable system of reducing 
gears. ‘The mechanism operates in unison with the generator 
at the power station just as if it were directly connected to it. 
The possibility of using such an electric motor as a clock lies 
in the fact that the generators in the power house are now 
operated in synchronism with a very precise chronometer, 
the accuracy of which is continually checked by astronomical 
observations. Thus it is that the average man can now have, 
for five dollars, a clock which is more accurate than the chro- 
nometer in the jewelry store. It seems obvious that such an 
accurate time indicator should find use in the psychological 
laboratory. 

In addition to the element of accuracy there is another 
advantage'of using the synchronous motor as a time reference 
in the laboratory, namely that it is much more rugged and 
powerful than the old-style clock motor and its accuracy is 
therefore not interfered with by any reasonable load that is 
put upon it. Any machine which is operated by a synchro- 
nous motor using regulated current moves with a perfectly 
constant speed and is therefore, in itself, an accurate time 
keeper. 

The simplest method of making use of the regulated cur- 
rent in the laboratory is to purchase a simple electric clock and 
adapt if for the purposes at hand. ‘There are several wheels 
or gears in the electric clock which run much faster than in the 
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ordinary clock and these can be arranged to make electric 
contact so as to operate a time-marker or sounder at almost 
any frequency desired. No especial precaution need be taken 
to prevent friction at the points of contact as the motor does 
not slow down under a load as do most spring-operated de- 
vices. The motor in an electric clock is small but it is com- 
paratively much more powerful than the spring motor of a 
clock. The writer has used a clock which has a starting wheel 
projecting at the back which rotates at a speed of one revolu- 
tion per second. Attachment to this wheel of a metal arm 
which makes contact with a mercury cup provides an excellent 
substitute for the seconds-pendulum which is ordinarily used 
for operating an electric sounder. Lesser intervals can be 
obtained by attaching additional arms to this wheel, or by 
making connection with some of the gears that are turning 
more rapidly. Longer time intervals can be obtained by 
connecting with the seconds-hand, minute-hand or hour-hand 
on the face of the clock. 

Larger types of synchronous motors can be used to operate 
special laboratory devices, such as a kymograph or tachisto- 
scope. Once the proper speed-changing mechanism is pro- 
vided for connecting the motor to the apparatus one can be 
quite certain that it will always move at a speed which is 
constant and accurate beyond any ordinary requirement. 

One of the questions that naturally arises in regard to any 
new means of time measurement is whether it may make pos- 
sible a simpler and more accurate type of chronoscope. The 
schema of such an instrument is shown in Fig. 1. T is the 
turn-table of a phonograph which is operated by a synchro- 
nous motor using the regulated current. The particular turn- 
table motor used in the construction of this apparatus is part 
of a Rotrola phonograph pick-up unit. It is intended to be 
used in connection with a radio amplifier and speaker for the 
purpose of reproducing phonograph records. It is made by 
the Allen-Hough Company of Milwaukee. Any other similar 
unit would serve equally well provided it includes a synchro- 
nous motor. On the turn-table is placed a disc of paper which 
has been cut to the pattern of a phonograph record. 
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The inductor unit, which is ordinarily at the end of the 
flexible arm, F, is replaced by a pencil as shown in the diagram. 
This pencil is supported slightly above the paper by a cord 
which runs over the pulley, P, and is attached to a metal ring 
which is hooked over a metal peg at N. A branch from this 
cord is attached to the response key, R. 

When the stimulus key, S, is depressed the cord is pushed 
off the peg which allows the pencil to drop on the rotating arm. 
It continues to draw a line on the paper until the pencil is 
lifted again by depression of the response key, R. The length 
of the line should be measured in terms of degrees of a circle 






































with a protractor. When the speed of rotation of the turn- 
table is eighty revolutions per minute the degrees of rotation 
per second is 480, which means that each degree represents 
2.08 sigma. 

Additional records can be made on the same paper simply 
by shifting the pulley, P, laterally on the axis, 4. 

Special paper records are being constructed which are 
engraved with radial lines which give the sigma values directly. 
The paper blanks which are used on the disc type of recording 
thermometer are also very easily adapted for use with this 
apparatus. 

It is possible, of course, to locate the keys in any position 
and at any distance from the instrument, provided that the 
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cord does not stretch a great deal under changes in tension. 
A means of electro-magnetically releasing and lifting the 
pencil-arm could be devised quite readily and might facilitate 
control and reduce mechanical errors to some extent. 

I’xperience with this instrument seems to indicate that it 
is more accurate, and quicker and simpler in action than any 
of the less-expensive chronoscopes. The motor and pick-up 
units can be bought for about twenty-five dollars and may 
serve a variety of other purposes in the laboratory, such as 
those mentioned in the first part of this note. 


(Manuscript received June 6, 1931) 














A FLEXIBLE APPARATUS FOR RECORDING 
READING REACTIONS 


BY MILES A. TINKER 


University of Minnesota 


This apparatus for exposing stimuli and recording re- 
sponses is the same general scheme as described by Dodge ! 
and Miles,? but with modifications in certain units which (1) 
permit wider application of the apparatus to problems involv- 
ing visual perception; (2) increase the accuracy and ease of 
scoring records; and (3) reduce accompanying noises. A 
complete description of the Dodge apparatus, as modified by 
Miles, has been given by Tinker.’ 

In the present apparatus the exposure device was con- 
structed almost entirely of aluminum and contains a window 
1; by 4 inches. (A still larger window may be used.) This 
is large enough to expose paragraphs 30 words in length when 
printed in typewriter script or in 12 point type. Hence it is 
possible to expose such printed stimuli as single words, sen- 
tences, paragraphs, mental and scholastic test items, number 
series, formulae, geometric forms, and the like. The key for 
breaking the circuit to make the exposure may be manipu- 
lated by either the experimenter or the subject. Furthermore 
the response may be made by speaking into the voice key or 
by pressing a break key. Incase the subject is employing his 
hands to facilitate computation (using pencil and paper) of 
the problem exposed in the window, the experimenter can 
break the circuit to record a response simultaneously with the 
writing of the response by the subject. 

The timing unit is a Zimmerman vibrating metal bar which 
makes and breaks the time circuit by means of a mercury-dip 


1R. Dodge and F. G. Benedict, Psychological effects of alcohol, Carnegie Institute 
of Wash. Publ., 1915, No. 232, 90-108. 

2W.R. Miles, Alcohol and human efficiency, Carnegie Institute of Wash. Publ., 
1924, No. 333, 87-89. 

3M. A. Tinker, Reading reactions for mathematical formule, J. Exper. Psychol., 


1926, 9, 444-467. 
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contact. This timer may be adjusted for producing 5, 10, or 
25 breaks per second. In reactions to words it is necessary to 
use a unit of one-fifth or one-tenth second to assure accurate 
and easy reading of the records, especially if the chronograph 
motor varies in speed. Such records may be read to one- 
twentieth second with a high degree of accuracy. For longer 
reaction times (reading paragraphs) where time intervals of 
3 or of one second are desired, a metronome or a time clock 
may be used in the time line circuit. 

The recording unit of the apparatus is a Gaertner tape 
recording chronograph carrying two electro-magnetic markers. 
These markers (as used for time line) respond accurately with 
either 5 or 10 excursions per second. With two or more ex- 
cursions per second, however, they become decidedly noisy 
which tends to become disturbing. ‘To eliminate this noise 
a small frame was attached to the covering of the magnets. 
One end of this frame projected out, parallel to the marker 
arm. On this projection was a set screw with a soft rubber 
tip. With proper adjustment of the screw the rubber tip 
serves as a bumper which prevents the marker arm from click- 
ing against the magnet. This reduces the excursion of the 
marker slightly, but eliminates the noise to such a degree that 
it is not noticed above the low hum of the chronograph motor. 
Any kymograph with a spiral drive may be substituted for the 
Gaertner chronograph. 


(Manuscript received April 1, 1932) 
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